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Lecture 1 


Binary Quadratic Forms 


1.1 The theory of modular form originates from the work of C.F. Gauss of 1831 
in which he gave a geometrical interpretation of some basic notions of number 
theory. 











Let us start with choosing two non-proportional vectors in R? 





v=(a,b), w=(c,d). 


The set of vectors 











A = Zv + Zw := {mv + mow € R?| m,m € Z} 














forms a lattice in R?, i.e., a free subgroup of rank 2 of the additive group of the 
vector space R?. We picture it as follows: 
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The area A(v, w) of the parallelogram formed by the vectors v and w is given 
by the formula 


A(v, w)? = det |. 


VW WW 


Let x = mv + nw € A. The length of x is given by the formula 


Vv: Ww WW 


Ixl? = ||mv + nw]|? = (m,n) Gee ~ (e) = am? + 2bmn + en?, 


where 
a=v-v, b=v-w, c=Ww-w. (1.1) 
Let us consider the (binary) quadratic form (the distance quadratic form of A) 
f = ax? + 2bry + cy’. 
Notice that its discriminant satisfies 


D = 4(b? — ac) = —4A(v,w)? < 0. (1.2) 


Thus f is positive definite. Given a positive integer N, one may ask about integral 
solutions of the equation 


f(x,y) =N. 


If there is an integral solution (m, n) of this equation, we say that the binary form 
f represents the number n. Geometrically, this means that the circle of radius y/n 
centered at the origin contains one of the points x = mv + nw of the lattice A. 
Notice that the solution of this problem depends only on the lattice A but not on 
the form f. In other words, if we choose another basis (v’, w’) of the lattice A, 
then the corresponding quadratic form 


f' —2 d'z? + 2b'zy + cy’, 
where a’ = v’-v’, b = v'-w', œ = w- w has the same set of integral 
solutions for the equation 
f'(z,y) =N. 
Let 
vi =av+ yw, v =6v+ow. 


for some a, 0, y,6 € Z. Since the matrix 
a 
y ô 
is invertible in the ring of integral matrices, we must have 


det M = aô — py = +1. 


It is easy to see that 
Trd 1 Ba 1 . . 
K E a _ wut h v v w)m 
vew wew vew WW 
ad bV\ f(a y\fa “BN foie 8 
de d} \B OME? dj \y 8j’ 


This can be also expressed by saying that the form f’ is obtained from the form f 
by using the change of variables 


and hence 


xz —> azt + By, yoryet oy. 


We write this in the form 

f =Mf. 
Following Lagrange, we say f and f’ are equivalent. An equivalence class is 
called a class of quadratic forms. Obviously, for any positive integer N, the set 
of integral solutions of the equations f(x,y) = N depends only on the class of 
forms to which f belongs. Also it is clear that two equivalent forms have the same 
discriminant. 
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1.2 As we saw before any lattice A determines a class of forms expressing the 
distance from a point in A to the origin. Conversely, given a positive definite 
binary form f = ax? + 2bry + cy”, we can find a lattice A corresponding to this 
form. To do this we choose any vector v of length \/a and let w be the vector 
of length \/c which forms the positive angle with v defined by cos ọ = b/,/ac. 
Obviously we use here that f is positive definite. Of course, A is defined uniquely 
up to an orthogonal transformation of R?. 
In this way we obtain the following: 























Theorem 1.1. There is a natural bijection between the set of lattices in R? modulo 
an orthogonal transformation and the set of classes of positive definite quadratic 
forms. 





Let us describe the set of classes of forms in a more explicit way. 


Theorem 1.2. Let f be a positive definite binary form. Then there exists a form 
g = Av? + 2Bry + Cy? equivalent to f which satisfies the conditions: 


{0<2B<A<C}. 


Proof. Let f = ax? + 2bry+cy? and A be a lattice associated to it. Let us change 
the basis of A in such way that the corresponding form 


g = ||v'Pax* + 2v! - w'zy + w'y? 


satisfies the assertion of the theorem. We take v’ to be a vector from A of smallest 
length ya. Then we take w’ to be a vector of smallest length in A which is 
not equal to +v’. I claim that (v’, w’) forms a basis of A. Assume it is false. 
Then there exists a vector x € A such that x = av’ + bw’, where one of the 
coefficients a,b is a real number but not an integer. After adding some integral 
linear combination of v’, w’ we can assume that |a|, |b] < 5. If a,b + 0, this gives 


1 
ixl? = lal liv? + blw? + 2abv! -w < (alliv' li + [elle ID? < liw? 
that contradicts the choice of w’. Here we have used the Cauchy-Schwarz in- 


equality together with the fact that the vectors v’ and w’ are not proportional. If a 
or b is zero, we get ||x|| = $||v’|| or |||] = || w’||, again a contradiction. 


Now let us look at g. The projection of v’ + mw’ to w’ is equal to (m + 
ns g proj q 
we . We can choose m to get that the length of the projection is less than or 
equal than 4. However, the shortest projection corresponds to the shortest vector. 
q 3 proj P 


Taai x 
yw < 1. It remains to change v’ 
liv” 2 


to —v’, if needed, to assume that b = v- w’ > 0, hence 0 < 2b < a. 





By our choice if w’, we must have —+ <b= 














Definition. A positive definite binary quadratic form ax? + 2bry + cy? is called 
reduced if 
0<2b<a<e. 


The previous theorem says that each positive definite binary quadratic form is 
equivalent to a reduced form. 
Let 














Q = {(a,b,c) ER? :0< 2b<a<ca>O0,ac> b*}. (1.3) 


By Theorem 1.2, any positive definite binary quadratic form is equivalent to a 
form ax? + 2bry + cy”, where (a, b,c) € Q. 


1.3 Let us find when two reduced forms are equivalent. To do this we should 
look at the domain Q from a different angle. Each positive definite quadratic form 
f = ax? + 2bry + cy? can be factored over C into product of linear forms: 


f = az? + 2bry + cy’ = a(x — zy) (x — Zy), 


where 
= — h2 
M E OEE, (1.4) 


a a 





It is clear that f is completely determined by the coefficient a and the root z. 
Observe that Im z > 0. We have a bijective correspondence 





f = ax? + 2bry + cy? > (a, z) 











from the set of positive definite binary quadratic forms to the set R} x H, where 





H={zeEC:Imz>0} 


is the upper half-plane. Let us see how the group GL(2, Z) acts on the both sets. 
We have 
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Mf =a((ax + By) — z(yax + dy))((ax + By) — Z(yx + dy)) = 


a(x(a— yz) — y(—8 + 46z))(x(a — yz) — y(—8 + 62)) = 
—p + ôz —B+6z 
= i= y). 


a— yz a- Zz 
Let us consider the action of GL(2, Z) on C \ R by fractional-linear transforma- 
tions (also called Moebius transformations) defined by the formula 





ala — zy’ (s — 





























a B az+ 8 
igre : 1. 
(; 5) oye +6 om 
Notice that 
az +B (az+B)zZ+46)  ad— By 
Im M-z=I1 = = Im z. 1. 
MoS M gza petap harap Oo 














This explains why the transformation is well-defined on C \ R. Also notice that 


Mt -aem ( £ m: 


=y Q 


Thus the root z is transformed to the root z’ = M~! - z and we obtain, for any 
M e GL(2, Z), 


M. f = ajyz + ô| (x — M - z)(x — M - Z). 


1.4 Until now we considered binary forms up to the equivalence defined by an 
invertible integral substitution of the variables. We say that two binary forms are 
properly equivalent if they differ by a substitution with determinant equal to 1. In 
other words, we restrict ourselves with with the subgroup SL(2, Z) of GL(2, Z). 


Since 
1 0 


GL(2,Z) = SL(2,Z) U € 2) SL(2, Z) 
1 0 2 2 2 2 . . 
and 0-1 (ax? + 2bxry + cy*) = az’ — 2bxy + cy* we obtain that each f is 
properly equivalent to a form ax? + 2bxry + cy”, where (a, b,c) € Q and 


Q = {(a,6,c) € R? : |26]| < c < a,a, ac — b? > 0}. 














Definition. We shall say that f = ax? + 2bxy + cy? is properly reduced if 
(a,b,c) EQ. 


Since 
0 1 


GL(2, Z) = SL(2, Z) U (; s) SL(2, Z) 
and (? 4) corresponds to the switch of the basis vectors v, w of the lattice, we 
obtain 


Theorem 1.3. There is a natural bijective correspondence between proper 
equivalence classes of positive definite binary forms and lattices in R? modulo 
rotation transformation. 























Let Qf be the set of positive definite binary quadratic forms on R?. The 
group SL(2, Z) of integral unimodular invertible matrices acts naturally on QF by 
f ~ M~'f. The map OF — R, x H defined in above is SL(2, Z)-equivariant if 
we let SL(2, Z) act on the target by 


(a, z) => (alyz + ô|, M - z). 

















Note that we have restricted ourselves to the subgroup SL(2, Z) in order to have 
ImM.z>0. 


Using (1.1) we see that the conditions 0 < |2b| < a < c correspond to the 
conditions 


1 1 
=p REGS ah Imz > 0. 


Let D be the subset of the upper-half planes described by the above inequalities. 
It is called the modular figure and looks as follows: 


fY 


D 
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So we have a bijective correspondence between 2 and R, x D. 


Now suppose f, f’ € Q and M~!. f = f' for some M € SL(2, Z). Replacing 
(f, M) with (M - f, M~*), if needed, we may assume that Im M - z > Im z. 


The formula (1.3) implies that |yz + ô| < 1, where M = Al This gives 


ô 
y € {0,1,—1}. 

Assume y = 0. Then the Moebius transformation defined by M~t is the 
translation z > z+ a and hence takes z out of the domain —} < Re z < 4 unless 
B = Qor p = +1 and Rez = +3. In the first case M = +/ and f = f’. In the 

1 +1 
second case M = + 0 1 J f = ax? + axy + cy? and f' = ax? F ary + cy’. 
Assume y = +1. If y = 1, then |z + ô| < 1 implies 


G) ô= 0, |z| = 1, or 


Gi) z = p := =Y and ô = 1. 





Q 


In case (i) we have M = + ({ 


ao and M-z=a- z, This easily implies 


a = 0 or (a, z) = (—1, p), (1, —p?). So, in the first case, M = fe J] ane 


M - f = cx? — 2bry + ay”. Since (c,b,a) € Q, we get a = c. Again f is of the 
form ax? + 2bry + ay? and is properly equivalent to ax? — 2bry + ay’. 

In the second case f = a(x? + xy + y?) and Mf = a(x? — ry + y’). 

—1 

ee ) and M-p = (ap+(a-1))/(o+ 
1) =a + p. This implies a = 0, f = a(x? + zy +y’), Mf =f. 

Finally, the case y = —1 is reduced to the case y = 1 by replacing M with 
—M. 

This analysis proves the following: 





Now, in case (ii), we get M = 





Theorem 1.4. Let f = ax? + 2bry + cy? and f' = a'x? + 2b'zy + cy? be two 
properly reduced positive definite binary forms. Then f is properly equivalent to 
f' if and only if f = f' or f = ax? + azy + cy’, f! = ax? F axy + cy’, or 
f = ax? + 2bry + ay’, f! = ax? — 2baxy + ay”. Moreover, Mf = f for some 
M ¢ +1 if and only if one of the following cases occurs: 








(i) f =a(a? +y’) and M = (4 ay 


(ii) f = a(x? + zy + y’) ond M = (7 eG ul 


Definition. Let G be a group acting on a set X. A subset S of X is called a 
fundamental domain for the action of G on X if each orbit of G intersects S at 
exactly one element. 


The proof of Theorem 1.4 shows this enlarged set contains a representative 
of each orbit of SL(2, Z). Moreover, two points (a, b, c) and (a’, b’, c’) in Q belong 
to the same orbit of SL(2, Z) if and only if either a = c = a = c,b = —b' or 
a' = a,b' = —b = a/2. Clearly 














Q=R,xD. 


To get the fundamental domain for the action of SL(2, Z) on Q7 we have to con- 
sider the subset 1’ of 2 defined by the following inequalities: 


Q= {(a,b,c) E02: |2b]) <a<c or a=c>2b>0 or a=2b> 0}. 


The corresponding subset of the modular figure is obtained by deleting from it the 
vertical line Re z = 1/2 and the part of the unit circle where the argument is less 
than 7/2. 


Since we do not need we leave it to the reader to state an analog of Theorem 
1.3 for reduced (but not properly reduced) forms and find a fundmanetal domain 
for action of GL(2, Z) on QF. 


1.5 Theorem 1.4 has a nice application to number theory. 


Definition. A binary quadratic form ax? + 2bry + cy? is called integral if a, 2b, c 
are integers. It is called primitive if (a, 2b,c) = 1. 


Corollary 1.1. . The set of reduced integral positive definite binary forms with 
fixed discriminant D = 4d is finite. 


Proof. If we fix the discriminant D = 4d = 4(b* — ac), then there are only finitely 
many points in the domain Q whose coordinates are integers. 














Definition. We say that two integral positive definite binary forms are in the same 
class if they are properly equivalent. 


Corollary 1.2. The set of classes of primitive integral positive definite binary 
forms with the same discriminant is finite. 
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Exercises 











1.1 Let A be a lattice in R?. Show that the number of vertices of shortest distance 
from the origin can be equal only to 2, 4 or 6. Find the lattices with 4 and 6 shortest 
distance points. 























1.2 Show that any subgroup of R? which is a discrete set (i.e. each ball in R? 
contains only finitely many elements of the set) is a free abelian subgroup of rank 
at most 2. 


1.3 Let A be a lattice in R?. Let us identify R? with C in the usual way. Consider 
the set O, of complex numbers z such that z- A C A. 
































(i) Show that Oy, is a subring of C and A is a module over Oy; 


(ii) Show that Oa = Z unless there exists c € C such that cA is contained in 
some imaginary quadratic extension Q(./—d) of Q. 


1.4 We say that a lattice A admits a complex multiplication if the ring O, defined 
in the previous exercise is different from Z. Assume that A satisfies this property. 
Prove the following assertions: 


(i) the field of quotients K of O, is a quadratic extension of Q which contains 
A; 


(ii) a distance quadratic form of A is proportional to an integral quadratic form; 


(iii) the quadratic field K is equal to Q(v b? — ac); 





a ea 2 — AD 


(iv) the ring O, is generated over Z by 1 and fw, where w = = 


if d = 1 mod (4), and w = /—d, f*d = D otherwise. 


1.5 Let K = Q(\/—d) be an imaginary quadratic field. We shall assume that d is 
a square free integer. We say that two lattices A and A’ contained in K are similar 
if A’ = aA for some a € K. 


(i) Find a natural bijective correspondence between the similarity classes of 
lattices contained in K and the proper equivalence classes of primitive inte- 
gral positive definite binary forms ax? + 2bxry + cy? which decompose into 
the product of linear forms over K and whose discriminant D = 4(ac — b°) 
is equal to the square of the volume of the fundamental parallelogram of the 
corresponding lattice. 
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(ii) Let ax? +2bry + cy? represents a class of primitive integral positive definite 
binary forms corresponding to the lattice A with complex multiplication 
defined by the ring O,. Show that a and —2+2v’"—ae vb" ae generate a proper ideal 
in O Ae 


1.6 Let A and A’ be two lattices admitting complex multiplication with O, = Ow = O. 
Define A - A’ as the subgroup of C generated by the products AX, A € A,X’ € A’. 


(i) Show that A - A’ is a lattice A” with Oar = O; 


(ii) Show that the operation of product of lattices defined in part (i) is compatible with 
the similarity relation and defines the structure of a finite abelian group on the set 
of similarity classes of lattices A with the same ring O4. 


1.7 Using the previous exercises define the structure of an abelian group on the set C(D) 
of proper equivalence classes of primitive integral positive definite binary forms of given 
discriminant D. 


(i) Compute the product of two forms ax? + cy? and a'z? + cy? with ac = a'd. 


(ii) Show that the class of the form x? + ny? (resp. 2? + zy + ny?) is the unit of the 
group C(D) if D = 4n (resp. if D = 1 + 4n). 


(iii) Show that the class of ax?—bry+cy? is the opposite of the class of ax? +bry+cy?. 


1.8 Using Exercise 1.5 (ii) show that there is a natural isomorphism between the group 
of similarity classes of lattices with complex multiplication defined by a ring O and the 
group C(O) of ideal classes of O. 


1.9 Find all reduced primitive integral positive definite quadratic binary forms with dis- 
criminant D = —4, —8, —12, —20, —56. Compute the number h(D) of classes of primi- 
tive integral positive definite quadratic binary forms for these values of D. 





1.10 Show that h(—4n) > 1 if n is not a prime number. 
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Lecture 2 


Complex Tori 


2.1 As we saw in the previous lecture there is a natural bijection between the set Q 
of proper equivalence classes of positive definite binary quadratic forms and the product 
R+ x D’, where D’ is the subset of the modular figure D whose complement consists 
of points 5 + iy and e?,0<¢< 1/2. The factor R4 corresponds to the first coef- 
ficient a of the form f = ax? + 2bry + cy”. Now recall that the set of equivalence 
classes of positive definite binary quadratic formis also bijective to the set of lattices in 
R? modulo orthogonal transformation. The set of proper equivalence classes of positive 
definite binary quadratic forms corresponds to the set of lattices modulo rotation trans- 
formations. Now to get rid of the factor R let us consider lattices equivalent if one is 
obtained from another by multiplying with a nonzero complex number 4, i.e. A ~ A’ if 
A’ = {\v|v € A}. Since each complex number can be written in the form re’? we see 
that we allow, additionally to rotations, positive scalar dilations of lattices. If v, w is a 
basis of A, then Av, Aw is a basis of AA. In particular, if A = r is real positive, we get 
that the corresponding quadratic form f = ||v||?2? + 2v - way + ||w||?y? is multiplied 
by r?. Thus, we may always assume that ||v||? = 1, hence the equivalence class of A is 
determined by one root z € H of the quadratic form f modulo Moebius transformations. 
Thus we obtain 


Theorem 2.1. There is a natural bijection between the set of equivalence classes of lat- 
tices in R? and the subset D' of the modular figure D. 


Now let us find another interpretation of elements from D, this time as isomorphism 
classes of elliptic curves. 


Let A be a lattice in R?. Consider the orbit space 
E =R?/A. 
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One can choose a representative of each orbit in the fundamental parallelogram 
II = {ev + yw)0 < z,y < 1}, 


where v, w is a basis of A. In this parallelogram two points belong to the same orbit if and 
only if they differ by v or w. So, if we identify the opposite sides of II, we get a bijective 
map from II onto Æ. Topologically, Æ is homeomorphic to the torus, or the product of 
two circles. In fact, as a topological group, 


R?/A & R?/Z? S (R/Z) x (R/Z) = St x St. 


However, we can do more; we put a structure of a complex manifold on E which will 
depend only on the equivalence class of A. 

Before we do it let me recall some basics about complex manifolds. Let X be a 
topological space. A geometric structure on X is defined by assigning to any open subset 
U of X a certain ring O(U). Its elements will be interpreted as functions on U. This 
assignment satisfies the following property: 


(i) if V C U then there is a unique homomorphism of rings ry/y : O(U) > O(V) 
such that ryyu ° ru/v = Tw/v Whenever V CU CW. 


We would like to interpret elements of O(U) as functions on U and the homomor- 
phism ryvy is as the restriction of functions on U to the subset V. In order to do this, 
we shall require an additional property. Let x be a point of X. Consider the following 
equivalence relation on the union of rings O(U) where U runs through the set of open 
neighborhoods of x. Let f € O(U),g € O(V). We say that f ~ g if there exists an open 
neighborhood W of x contained in U N V such that ry;w(f) = rv;w(g). Denote the 
set of equivalence classes by O,. There is a natural structure of a ring on O, such that 
for any U containing x the canonical map O(U) — Ox is a homomorphism of rings. We 
require 


(ii) For each x € X the ring O; is a local ring, i.e. contains a unique maximal ideal. 


Let my denotes the unique maximal ideal of O, and k(x) = O,/m,. This is a 
field. For any open neighborhood U of x there is a canonical homomorphism of rings 
O(U) > O; — k(x) the image of f € O(U) in K(x) is called the value of f at x and 
is denoted by f(x). In this way each f € O(U) can be considered as a function on U, 
although at each point x of U the value of f at x may belong to a different field. Of course, 
we can consider the common set of values by taking the union of all fields x(x). In many 
special cases, each ring O(U) is equipped with a structure of an algebra over a field k 
and the restriction homomorphisms are k-algebra homomorphisms. In this case we may 
consider k as a subring of O(U); its elements are called constant functions. If are lucky 
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the residue homomorphisms O(U) — K(x) induce an isomorphism of fields k + K(x). 
In this case we may consider the value of any function on U as an element of the same 
field k. 

A topological space X together with a collection Ox of the rings Ox (U) satisfying 
the previous conditions (i) and (ii) is called a geometric space. The collection Ox is 
called the structure sheaf of the geometric space. 

An example of a geometric structure on X is obtained by taking Ox (U) the ring of 
continuous real functions on U. 

Obviously, a geometric structure Ox on X equips each open subset U C X with the 
restricted geometric structure. We shall denote it by Oy. A continuous map f : X > Y 
of geometric spaces is called a morphism of geometric spaces if for any open subset 
U C Y there is a homomorphism of rings fË : Oy (U) + Ox(f71(U)) satisfying the 
following properties: 


(i) for any V C U the following diagram is commutative: 


fË 

Oy (U) —— Ox(f-(U)) 

ruv | [momo 
# 


Gi) Let f(x) = y and let fiz : (Oy)y —> (Ox) be defined as follows. Take a 
representative ¢ € Oy (U) of 6 € (Oy), and define JÉ x(@) to be the equivalence 
class of bi (ġ) in (Oy )s. It is easy to see that this is wel-defined. We require that 
fi z maps my to my. 


One interprets the homomorphism fË as the composition of a function on U with the 
map f : f-!(U) — U. In fact, for each x € X with f(x) = y the homomorphism fis 
induces a homomorphism of fields ie : K(y) > K(x) such that, for any ¢ € Oy (U), y € 
U, 

f*(U)($)(2) = Fl (2))) 


So, a morphism of geometric spaces is a continuous map f : X — Y which transforms 
functions on Y to functions on X. 


We leave to the reader to define compositions of morphisms of geometric space and 
to show that the identity map X — X is a morphism of geometric spaces. This will 
define a category of geometric spaces. The notion of isomorphism of geometric spaces is 
immediate: it is a morphism of geometric spaces which admits the inverse. 


To define a geometric structure on X one need not to define O(U) for all U; it suffices 
to do it only for an open set in a base {U;};e7 of the topology. Then for any open U we 
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set 


OW) = lim O(U;) 
UCU 


Here we use the definition of the projective limit: the subset of the product Į J;e; O(Ui) 
which consists of strings (...,@j,...,@;,---) such that ry, jy, (ai) = Ty,/u,(a;) when- 
ever Ug C U; N U}. 

We will be mainly concern with an example of a complex structure. Let us define 
it. Let X = C” equipped with its standard topology defined by the Euclidean metric 
\|z|| = (a|? +... + |zn|?)!/?. We define a complex structure on X by assigning to 
each open ball U, (a) with center at a and radius r the ring O(U;-(a)) of complex valued 
functions on U,(a) which admit an expansion 


f(z) = se Oi, in (21 — 41)" ... (Zp — an)” 


tl, tn 20 


absolutely convergent in U, (a). A complex valued function on an open set U belongs to 
O(U) if and only if for any point a € U there exists a ball U, (a) contained in U such that 
the restriction of f to it belongs to O(U,(a)). Such functions are called complex analytic 
or holomorphic functions on U. A non-trivial result from complex analysis says that a 
function f = u + iv : U — Cis holomorphic in U if and only if it admits continuous 
partial derivatives with respect to the real and imaginary coordinates x;, y; in C” and 
satisfies the Cauchy-Riemann differential equations in U 


oo Leri ðv, i ðu ðv 
Zi > 2 Ox; OY; eg OY; Ox; 





0, 


We shall denote the ring of holomorphic function on U by ©"! (U). The sheaf defined by 
the rings O°! (U) defines a structure of a geometric space on C”. It is called the complex 
affine n-dimensional space. Clearly the field C can be identified with constant functions 
and all residue fields k(x) can be identified with C. 


Definition. A geometric space (X, ©) with Hausdorff X is called a complex manifold of 
dimension n if for each x € X there exists an open neighborhood U such that the geo- 
metric space (U, Oy) is isomorphic to an open ball in C” with the restricted geometric 
structure of the complex affine n-dimensional space C”. A complex manifold of dimen- 
sion 1 is called a Riemann surface. A morphism of complex manifolds (not necessary of 
the same dimension) is called a holomorphic map. 


A complex manifold is an example of a geometric space (X, Ox ) where the following 
additional property of Ox is satisfied: 
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(ii) Let U = UjesU; be an open covering. Suppose that a collection of functions 
fi € O(U;) satisfies 


TU; /UiNU; (fi) = TU; /[U;0U; (fi), Vi, j & I. 
Then there exists a unique f € O(U) such that, for any 7 € I, ryyu,(f) = fi- 


Example 2.1. Each non-empty open subset of C” with the restricted structure of the geo- 
metric space is a complex manifold of dimension n. A map f : U — V of an open subset 
of C™ to an open subset of C” is given by n functions f;(z) (defining the composition 
U + VC"), It is holomorphic if and only if each f;(z) is a holomorphic function on 
U. More generally, let f : X — Y be a holomorphic map of complex manifolds. Take an 
open neighborhood V of a point y € f(X) isomorphic to an open subset V’ of C” and let 
x € X be mapped to y. Then f~!(V) contains an open neighborhood U of x isomorphic 
to an open subset U’ of C™. The map f : U — V defines a map f’ : U’ —> V’ of open 
subsets of the corresponding complex affine spaces. Then f is holomorphic if and only if 
f’ is holomorphic (for all x € X). 


Example 2.2. Let X = C U {oo}. Define the topology on X by extending a base of the 
standard topology on C by adding open neighborhoods of oo of the form 


U,(oo) = {z EC: |z| > r}U {oo} 


Now extend the structure sheaf O°! on C by adding the rings O(U;-(00)), each equal to 
the ring of complex valued functions f(z) on U;,(0o) such that f(1/z) € O(U;/,(0)). We 
have X = Uo U U1, where Up = Up(oo) = X \ {0} and U1 = U(0) = X \ {oo} = C. 
The homeomorphism 7 : Up — U; defined by the formula z — 1/z is an isomorphism of 
the geometric spaces. In fact f is holomorphic on an open U C U; if and only if f(1/z) 
is holomorphic on t~!(U). Since Up is obviously isomorphic to C, we obtain that X 
is a geometric space. It is called the Riemann sphere or complex projective line and is 
denoted by CP!. Using the stereographic projection, we see that CP! is homeomorphic 
to a two-dimensional sphere. 


Remark 2.1. A more traditional way to define a structure of a complex manifold is by 
using local charts. A collection of {(Ua,¢a)} of open subsets Ua of X together with 
homeomorphisms a from Ua to an open subset of C” is called a local chart if X = UaUa 
and, if Uy N Ug # 0, the map ¢g 0 $4! : da(Ua N Ug) + 6g(Ua N Up) is holomorphic. 
Two local charts are called equivalent if their union is a local chart. A structure of a 
complex manifold of dimension n on X is an equivalence class of local charts. We leave 
it as an exercise to check that the two definitions are equivalent. 


Let G be a group which acts holomorphically on a complex manifold X. This means 
that for each g € G the map u(g) : x + g-x is holomorphic. It follows from the definition 
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of an action of a group on a set that j(g~) is the holomorphic inverse of u(g). Thus each 
u(g) is an automorphism of the complex manifold X. We would like to equip the set 
of orbits X/G of G with a structure of a complex manifold. We restrict ourselves with 
the case when G acts properly discontinuously on X. This means that for any compact 
subsets A, B of X the set {g € G: g(A)N B # O} is finite. In particular, for any x € X 
the stabilizer subgroup Gy = {g € G : g- x = x} is finite. 


Theorem 2.2. Let G be a group which acts holomorphically and properly discontinuously 
on a Riemann surface X. Then the orbit space X/G admits a structure of a Riemann 
surface such that the canonical map p : X — X/G is holomorphic. This structure is 
unique up to isomorphism. 


Proof. First we define the topology on X/G. This is standard. By definition a subset of 
X/G is open if its pre-image p~!(U) is an open subset of X. Now we define the structure 
sheaf. By definition 
Oxja(U) = Ox(p-"(U))° = 
{f € Ox(p"(U)) : f(g- £) = f(z), Yg € Ga € p`! (U)} 

It is immediately verified that this defines a structure of a geometric space on Y = X/G. 
Let us show that it is isomorphic to a Riemann surface. Let y = G - x be an orbit, 
considered as a point of Y. Since X is locally homeomorphic to R?, it is locally compact. 
Thus x contains an open neighborhood U whose closure U is compact. Let U = U; D 
Uz D ... be a sequence of strictly decreasing open neighborhoods of x with N,Un = 
{x}. Since each U is relatively compact and G acts properly discontinuously, the set 
G(n) = {g € G: Un N g(Un) F Ø} is finite. Clearly G(n) C G(m) for m < n. Thus 
there exists some N such that G(m) = G(N) for all m > N. I claim that G(N) C Gr. 
In fact, if this is false g- 2 = x’ # x for some g € G(N). The map g : X > X 
matches the filter of open neighborhoods U,, of x with the filter of open neighborhoods 
g(Un) of x’. Since our topology is separated, we can find an open subset U,, with large 
enough n such that g(U;,) N Un = Ø. However this contradicts the definition of G(V). 
So G(N) C Gz. Obviously, Gy C G(N). Thus G(N) = Gy, and in particular is finite. 
Therefore the set Ngec, g(Un) is an open neighborhood of x. It is invariant with respect 
to G,. Moreover, for any x’,7’ € Uy we have x” = g 2’ for some g € G implies 
g € Gz. In particular g(Un) N g/(Un) Æ 0 if and only if g, g' belong to the same coset 
of G modulo the subgroup Gg. Thus 


p(p(Un)) =UgeagUn) = [| sUn) 
gGrEG/H 


is the disjoint union of open subsets homeomorphic to Uy, and hence is open. This 
implies that V = p(Uy) is an open neighborhood of y = Gg in Y. Since each G- 
invariant function on p-!(V) is determined uniquely by its values on Uy we obtain 


Oy(V) = O(Un)& 
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If we replace V by a smaller open subset V’ and replace Uy with Uy = Un N ptv’) 
we similarly get 
Oy(V') = OUN) 


This shows that V is isomorphic, as a geometric space, to the orbit space Uy /G',. In fact 
the isomorphism is induced by the restriction of the morphism p : X — X/G of geomet- 
ric spaces to Uy. Its fibres are G,-orbits in Uy. Thus we have reduced our assertion to 
the case when the group G is finite and also fixes a point x € X. Now we have to use the 
assumption that X is of dimension 1. Without loss of generality we may assume that X 
is an open ball of finite radius r in C with center at the origin. For each g € G the map 
u(g) : X — X is given by a holomorphic function f(z) with f’(z) 4 0 at each point in 
X and f(0) = 0. An elementary theorem from the theory of functions in one complex 
variable says that f(z) = ze’, i.e. g defines a rotation of the ball. Since Gg is of finite or- 
der, we obtain that e’? = 1 for some d > 1. We also see that Gz is a cyclic group of order 
d. Now any function ¢(z) invariant with respect to the transformations z —> zn, n? = 1 
must be a holomorphic function in t = 2%. This easily follows by considering the Taylor 
expansion of (z) at 0. Now it is easy to see that the map z — z defines an isomorphism 
of geometric spaces U, (0)/G — U,.a(0). This proves the assertion. 














Remark 2.2. It follows from the proof that the assertion of the theorem remains true in any 
dimension if we additionally assume that G acts freely on X, i.e., the stabilizer subgroup 
Gx of any point x € X is trivial. In general case X/G is not a complex manifold but an 
analytic space with quotient singularities (also called a complex orbitfold). 


Corollary 2.1. Let us identify R? with C in the natural way. Then E = R?/A admits a 
structure of a compact complex manifold of dimension I for which the factor map C > E 
is a holomorphic map of complex manifolds. 


Proof. The group A acts on the complex manifold C by translations z > z + A,A € 
A. This action is obviously properly discontinuous. In fact any compact set B in C is 
contained in a finite union of A-translates of the fundamental paralellogram 


I= {z €C: z= aw + bux, 0 < a,b < 1}, 


where w1, w3 is a basis of A. Thus for any compact set A, we have (mw + mow, + A)N 
B = 0 if |m4|,|mo2| are sufficiently large. This leaves us only with finitely many A such 
that (A+ A)N BO. 














Definition. A Riemann surface X is called a complex torus of dimension 1 or an elliptic 
curve if it is isomorphic to C/A for some lattice A. 


Theorem 2.3. Two elliptic curves C/A and C/A’ are isomorphic if and only if ' = aA 
for some a € C \ {0}. 
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Proof. We shall use the simple observation that the geometric spaces C and E = C/A are 
locally isomorphic. This means that for any point z € C has a neighborhood isomorphic 
to an open neighborhood of z + A € E. This follows immediately from the proof of 
Theorem 2.2. Assume A’ = aA for some non-zero complex number a. Consider the map 
C — C defined by the formula z — az. It is an automorphism of the complex manifold 
C which maps A onto A’. It induces a bijective map of the orbit spaces C/A — C/A’. It 
follows from the previous remark that this map is holomorphic. 

Conversely, assume that there is a holomorphic isomorphism f : E = C/A > E’ = 
C/A’. Let f(0 + A) = zo + A’. Consider the map tz : E > E’ defined by the formula 
z+A — (z+29)+A". Itis easy to see that it is a holomorphic automorphism. Composing 
f with t_,, = tz! we may assume that f(0 + A) = 0+ A’. Now we use that the 
projection maps p : C — C/A and p’ : C — C/A’ are universal covers of the topological 
spaces. The composition C + C/A — C/A’ is a continuous map of a simply-connected 
topological space C to the torus C/A’. It has a unique lift to a homeomorphism f:C>C 
of the universal covers. Itis also a holomorphic map satisfying F0 (0) € A’. In fact, the 
composition p’ o f is equal to f o p and hence is holomorphic. This easily implies that f 
is holomorphic. Now for any \ € A and z € C we have f(z + a= f(z) € A’. Thus the 
continuous map z + f(z+A)— f(z) € A’ is constant and hence f(z+A) = f(z) + f(A). 
This shows that the partial derivatives of f are periodic with respect to A. By Liouville’s 
theorem, they must be constant. Hence f isa linear map of C which maps A to A’. 














Corollary 2.2. There exists a natural bijection between the set of isomorphism classes of 
elliptic curves and the modular figure D. 


The group law on C defines a group law of the quotient group C/A. It follows from the 
previous theorem that any holomorphic isomorphism of elliptic curves which sends 0 to 
0 is a homomorphism of groups. The group of holomorphic group automorphisms of the 
elliptic curve C/A is isomorphic to the group {a € C* : aA = A}. Let w1, w2 be a basis 
of A. Replacing A with zA for some z € C* we may assume that w1 = 1,w2 = w E€ H. 
Then 

aw=aw+PB, a-l= yw, 
p 
ð 
(w, 1) € C? is a complex eigenvector of M with eigenvalue a. The eigenvalue a = x + iy 
satifies the characteristic equation 


for some integral invertible (over Z) matrix M = (° ) . This shows that the vector 


— (a + ô)t + det M = 0. 


We have a +ā = 2x = —(a + ô) € Zand |a| = x? + y? = det M = 1. The only 


solutions are i 


(x,y) = (0, 1), ( 1,0), ( 2 3/2). 
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This gives 
e2t/3 ; e4T/3 , 

















a = +i, +1, 





Thus there are the following possibilities for the group G of holomorphic group automor- 
phisms of elliptic curve: 
G S Z/2,Z/4,Z/6. 


The first case is realized for any lattice A. The second case is realized by the lattice Z+ Zi. 
The third case is realized by the lattice Z + Tere. 

Let us show that any elliptic curve with G # {+} is isomorphic to either E; = 
C/Z + Zi or Ep = C/Z + Ze2™'/3, By Corollary 2.3, we may assume that w belongs 
to the modular figure. Thus |Re w| < 1/2 and |w| > 1. We already noticed in Lecture 
1 that the derivative of the Moebius transformation z —> sae at the point zp is equal to 
(czo + d)~?. Since af = 1 for some d > 0, the matrix M is of finite order. This implies 
that the derivative of the corresponding Moebius transformation is a complex root of 1. In 
particular, we have |yw + ô| = 1. This implies 











aw+ | |(aw + B)(yw + 6)| 
yw+d| [yw + ô|? 





|u| = 





= |(aw + 8)(q@ + ô)|. 





Since |w| > 1,and ad — Gy = 1 this gives |aw + 8|, |yw + ô| > 1. Thus 
ju] > [aw + 8| > allel, [ol > ho + ôl > lllo. 


Assume a # 0. Then we must have |a| = 1,8 = 0,|w| = 1. Assume y # 0. Then 
we must have |y| = 1,6 = 0, |w| = 1. Thus we have the following possibilities for the 


matrix M: 
Ap /10\,_,/0 1\,/0 1 pede UY 1 
TNO De Na pe Sh et, ye 


This gives the following possibilities for w: 


1 0 EE Avs 
waiM= (5 dan 9) OBI 


_ 2mi/3 =, /10\,/0 1\,_/0 1 ay 
w=e mM=+( Nee ila a ,G=Z/6. 


This proves the assertion. 
































Moreover we have shown that the group PSL(2, Z) = SL(2, Z)/+1 acts on the upper 
half-plane H freely except at the orbits of the points w = 1, e?™i/3, The stabilizer group 
PSL(2,Z); = Z/2,PSL(2, Z) 2-1/3 = Z/3. The elliptic curves corresponding to these 
two exceptional orbits are called harmonic (resp. anharmonic). 
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Exercises 


2.1 Let X be the set of prime numbers in Z together with 0. Define a topology on X 
by declaring that sets of the form V (n) = {p € X : p|n},n € Z are closed. For each 
open set D(n) = X \ V(n) take O(D(n)) to be the ring of rational numbers whose 
denominators are products of powers of prime divisors of n. Show that this defines a 
geometric structure on X. Show that x(x) = Fp, the prime field of p elements, if £ = p 
is prime and the field of rational numbers Q otherwise. Show that for any f = a/b € 
O(D(n)) the value of f at x is equal to itself if z = 0 and is equal to pore if x = pis 
prime. 





2.2 Using the notion of a geometric structure give a definition of a differentiable manifold 
of class C*. 

2.3 Show that the projective space P"(C) (defined as the set of one-dimensional linear 
subspaces in C”*") has a structure of a complex manifold of dimension n. Show that the 
natural map C”*! \ {0} — P” (C) defined by sending z = (20, . . - , Zn) to the line Cz is 
a holomorphic map. 

2.4. Let (X, Ox) be a geometric space. Assume that the value of f € O(U) at a point 
x € U is not equal to zero. Prove that the restriction of f to some open neighborhood V 
of x is an invertible element of O(V). 

2.5 Prove that any holomorphic function f : X — C defined on a connected compact 
Riemann surface must be a constant function. 

2.6 Let A be a lattice with complex multiplication (see Exercise 1.4). Show that the ring 
Ox is isomorphic to the ring of holomorphic group endomorphisms of the elliptic curve 
C/A. 

2.7 Let A be a cyclic subgroup of the multiplicative group C* of the field C generated by 
a complex number q with |q| 4 1. Show that the factor group C*/A has a structure of a 
complex manifold of dimension 1 isomorphic to an elliptic curve. 

2.8 Generalize the construction of an elliptic curve by showing that a quotient group 
C” modulo the subgroup A generated by 2n vectors linearly independent over R has a 
structure of a compact complex manifold of dimension n. It is called a complex torus of 
dimension n. 





2.9 Consider the action of the group G = {+1} on C? defined by sending (21, z2) to 
(—z1,—22). Show that C?/G does not admit a structure of a complex manifold such 
that the canonical map C? —> C?/G is holomorphic. However C? \ {0}/G is a complex 
manifold of dimension 2. 


2.10 Let P(z1,...,2n) : C” — C be a complex polynomial in n variables. Assume 
SF (ay, ..+;Qn) Æ 0, where P(a1,...,an) = 0. Show that there exists an open neigh- 


borhood U of the point (a1,..., an) such that U N P~'(0) is a complex manifold of 
dimension n — 1. Generalize this to the case of a polynomial map C” — C*. 
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2.11 Let P(z0,..., Zn) : C” — C be a complex homogeneous polynomial in n + 1 
variables. Assume that the equations ae =0, t=0,...,n have no common solutions 


in C”+! \ {0}. Show that the set of zeroes of P, considered as a subset of projective space 
P” (C) is a complex manifold of dimension n — 1. Generalize this to the case of the set of 
zeroes in P”(C) of a finite set of homogeneous polynomials. 
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Lecture 3 


Theta Functions 


3.1 It is known that a compact smooth manifold of dimension n can be always embed- 
ded in R2”*!, This theorem does not have its analog in the complex case. A compact 
complex manifold cannot be embedded in C™ for any N. This follows from the fact that 
any holomorphic function on a connected compact complex manifold must be a constant 
function. However, it is often possible to embed a complex manifold into projective space 
P” (C). A theorem of Chow says that in this case the complex manifold is isomorphic to 
a projective algebraic complex manifold. The latter is defined as the set of solutions in 
n (C) of a system of homogeneous algebraic equations 














Jip ge) = +o SN sg) SO (3.1) 


This system must satisfy the following smoothness conditions: 


(i) the polynomials f;,... 
Clt Tn]; 


, fn generate a prime ideal Ix in the ring of polynomials 


(ii) the rank r of the matrix 


oft Of, 
Oxo Orn 
J= : : (a0, ...,an) (3.2) 
ofn fy 
Oxo O@n 
does not depend on the point (ao, . . . , an ) satisfying the equations (3.1). 


The number d = n — r is equal to the dimension of the complex manifold defined by (3.1) 
(see Exercise (2.11)). Not every complex manifold X can be given in this way. A neces- 
sary (but not sufficient) condition is that the field M(X) of meromorphic functions on X 
has the transcendence degree over C equal to the dimension of X. A meromorphic func- 
tion is defined by choosing a covering of X by open connected subsets U; and assigning 
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to each U; an element f; of the field M(U;) of quotients of O(U)'® with the compatibil- 
ity condition f; = fj in M(U; Uj). Here we use the fact that O(U;)'® does not have 
zero divisors. The transcendence degree of the field M(X) over C is always less or equal 
to the dimension of X (see [Shafarevich], vol. 2, Chapter 8, §2). If X is a projective 
algebraic complex manifold, then its field of meromorphic functions coincides with the 
field of rational functions. A rational function is an element of the field R(X) generated 
by fractions ae tion} formed by homogeneous polynomials of the same degree consid- 
ered modulo the ideal J. The transcendence degree of this field is always equal to n — r. 
Dropping the condition (ii), we obtain the definition of an irreducible complex projective 
algebraic variety. Its dimension is equal to n — r, where r is the maximal value of the 
rank of the Jacobian matrix. 

We shall prove later that any compact complex manifold of dimension 1 is isomorphic 
to a projective algebraic complex manifold (a smooth projective curve). In this lecture we 
shall find such an isomorphism explicitly for complex tori X = C/A. Let us try to 
find a non-constant map f : X — P"(C). Recall that the complex projective space 
P” (C) is defined as the set of lines in C”+t, or equivalently as the set of non-zero vectors 
(z0,-..,2n) € C"*! considered up to multiplicatication by a non-zero scalar. The set 
P”"(C) is a complex manifold of dimension n. It is covered by n + 1 subsets U; = 
{(Z0,---,2n) : zi # 0} each isomorphic to C” (see Exercise 2.3). A holomorphic map 
f: X — CP”, after composing with the natural map C — C/A, is defined by n + 1 
holomorphic functions fo,..., fn on C which need not be periodic with respect to A but 
must satisfy the weaker property: 


filz +A) =er(z)fi(z), 1=0,...,.n, AEA, 
where e) (z) is a holomorphic invertible function on C. Let us try to find such functions. 


Definition. A holomorphic function f(z) on C is called a theta function with respect to a 
lattice A if, for any À € A, there exists an invertible holomorphic function e) (z) such that 


fle+A)=erl2) f(z), vee. 
The set of functions {e(z)} is called the theta factor of f. 


Example 3.1. Let A = Z + Zr, where r € H. We know that each lattice can be reduced 
to this form by means of a homothety transformation. Set 


Q(z; T) = a eit (n?r+2zn) 
neZ 


This function is holomorphic on C. In fact, we shall show that the series converges uni- 
formly on any bounded set in C. Then we can differentiate the series and see that the 
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derivative with respect to Z is zero. Thus the series represents a holomorphic function on 
C. Assume that |Im z| < c on a bounded set. Then 


X l [eiT (n?T+22n) | < X e77n°Im (7) e2men 


neZ nEZ 


Choose N such that e77 ™ (e4™ne < 1 for n > N. Then 
> enor im (T) e2men < `> e77n°Im (7)/2, 
n>N n>N 


The latter series is obviously converges. 
Now let us check that O(z; T) is a theta function. Obviously it is periodic with respect 
toz > z+m,m € Z. We also have 


O(z+7;T) = 5 eiT(n?T+2zn+2rn) = 5 eit ((nt1)?7+22(n+1)—7—2z) = 
neZ neZ 


eit(—T-2z) eit ((n+1)?7+22(n+1)) — e T7422) Q(z, T). 
nEZ 
Proceeding by induction we see, for any A = m + nr € A, we have 


O(z+m-+nt;T) = ea Ti(n?r+2n2) 9 (z; T), 
so that O(z; 7) is a theta function with the theta factor 


E€m+nr (z) = Win (nr +2nz) . 


This theta function is called the Riemann theta function. 


3.2 How to find a general form of a theta function? First notice that the theta factor 
satisfies the following condition: 


evi (z) = ey(z + A')Jeyx (2). (3.3) 
This follows from comparing the equalities: 
Fiz +A +A) = eng (2) F(Z), 


JEHAN) = exl +N) f+ N) = ealz + N)ex(2)f 2). 


Let (z) € O(C)* be a holomorphic invertible function on C. For any theta function 
f(z) with theta factor e) (z) the function f(z)(z) is also a theta function with the theta 
factor 


ealz)! = en(z)(z + A)P(z) 1. (3.4) 
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Definition. A set of holomorphic invertible functions {e)},e, satisfying the functional 
equation (3.3) is called a theta factor with respect to the lattice A. Two theta factors 
{exa}aca and {e,}\-, are called equivalent if they are either related by (3.4) for some 
invertible holomorphic function ¢(z) or obtained from each other by translation of the 
argument z > z+ a. 


Let Th({e,}; A) denote the set of theta functions with theta factor {e,}. Obviously 
it is a subspace of the space O(C) of holomorphic functions on C. Notice that for any 
f,g € Th({e,}; A) the meromorphic function f/g is periodic with respect to A. So, it 
defines a meromorphic function on C/A. Such functions are called elliptic functions. 

We have 

Th({e,}; A) = Th({e\}s A) 


if {e,} is equivalent to {e/ }. The isomorphism is defined by composition of multipli- 
cation with a function defined by (3.4) and the inverse image under the translation map 
Z>zZ+4. 


One can show, although we don’t really need it, that it is possible to find ¢(z) such 
that log(e,(z)¢(z + A)b(z)~!) depends linearly on z. Thus the theta factor e, (z) looks 
like 

elz)! = eT 2rilaaz+ba). 


Further replacing f(z) by f(z)e?™( (pa12*+(1—F)2) we may assume that a; = bı = 0. In 
particular 


f(z+D=f(2), fle+7) =e Pet) Ha). (3.5) 


Now we have 
fe+r +l) = f((2 +7) +1) = f(z +r) =e PMO f(a) 


f(z +r + 1) — E i 1) a T) = g 2Tilalz+1)+) F(a) = g rid om are) Eh, 


By comparison, we see that e~?7"* = 1, hence a = k for some k € Z. The first equality 


allows us to expand f(z) in a Fourier series 
= Qrinz 
= 5 CnE€ » GEC. 
neZ 
Replacing z with z + 7, we obtain 


(z $ T) =) Ch E2TinT o 2rinz _ ge Tener?) ls) = 
neZ 


X ion e 27d Qri(n—k)z =) Cntke 2rib e2minz | 


nEZ nEZ 
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2rinz 


Comparing the coefficients at e we get 


ae = cpe? NTH), (3.6) 


If k = 0 we must have c,, = 0 except maybe for one value N of n satisfying NT +b € Z. 
This gives 
f(z) = exert, 

If k # 0 we get a recursion for the coefficients. Assume k < 0. Let cy Æ 0 for 
some N > 0. Then |cy-p| = |ene~27(N7+4)|. Since Im 7 > 0, the absolute value 
of the coefficients Cy—sk, s > 1, will not go to zero and the Fourier series will diverge. 
Similarly, if cy 4 0 for some N < 0, we get |CN+sk|, s > 1, do not go to zero and again 
the series diverges. It remains to consider the case k > 0. In this case all coefficients are 


determined by k coefficients co, .. . ,Ck—1. In fact, we can solve the recurency explicitly. 
To simplify the computations, let us replace f(z) with f(z + 5 — 8), Then 
T b T b 
mei | a er 
fet EZ t 1) se MRED F244 TD) = 3.7) 


So we may assume that 
b = kr 2, 


Let s € {0,..., k — 1}. Then it is easy to check that 


Cirk Z eTil(s+rk)?r/k] Cs (3.8) 


is the explicit solution of the recurrency 3.6. This shows that each f(z) with the theta 
factor 


. a) 
em+nr(2) = eg erg) (3.9) 
can be written in the form i 
=l 
f(z) = y. CsOs(z; T)k, 
s=0 


where 
Os(2;T)k = y eTills+rk)?rT/k] 2riz(strk) s=0,...,k—1. 
rEZ 
It is convenient to rewrite these functions in the form 
Os(ziT)k = y eril(g +r) kr+2kz(ġ+r)] 
rEZ 
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It is easy to see using the uniqueness of Fourier coefficients for a holomorphic function 
that the functions ©,(z), are linearly independent and hence form a basis of the space of 
theta functions with the theta factor (3.9). 


3.3 Summarizing the previous computations, we obtain 


Theorem 3.1. Each theta factor is equivalent to a theta factor of the form 


2 
s n 
em+n7(2) = eT 2rilnkz+ "z kT) 


The space Th(k; A,) of theta functions with theta factor of this form is zero for k < 0. 
For k = 0 it consists of constant functions. For k > 0 it is of dimension k and is spanned 
by the functions 


Os(z; r)a = Dieter AEH, 5 =0,...,k- 1. 
rEZ 


Observe that 


2 2 2 
e` 2rilnkz+ kr) ~~ 2mi(mk! 2+ %k'7) Se 2ri(n(k+k')z+ %5 (k+ ki)r) 





Obviously, if f € Th({e,};A),g € Th({e\}; A) then fg € V({e,e\}, A). This implies 
that the multiplication of functions defines a bilinear map 


Th(k; A+) x Th(k’; A;) > Th(k + k’; A,). 
Notice that for k = 1 we obtain 
Oo(z; T) = O(z;7). 


Let us modify a little the definition of O (z; 7) introducing the theta functions with rational 
characteristics 


I(27)= > eTillatr)’r+2(z+b)(a+)] a,b EQ. 
rEZ 


In this notation 
Os(2;T)k = Vso(kz; kT) (3.10) 
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The functions 











. E ag 2 
(elr) =Vri(eit) =i DY (-1) 20g? (3.11) 
rést+Z 
r2 
b2(z|r) = violz; T) = ay v'q?, (3.12) 
res+Z 
8 
63(z|T) = Voo(z;T) = 5 vq? , (3.13) 
neZ 
n2 
O4(2lT) = Vor(zi7) = X (Da7, (3.14) 
neZ 
where v = €27'*, q = e?""7, are called the Jacobi theta functions. It is easy to check the 
following properties of functions a(z; T): 
Valz; T) = TOO Oy (2:7) ifa -a,b -bEz (3.15) 
Bap(Z +1; T) = 5y eilla Hr)?r+2(z+b)(a+r)+2(a+r)] = e? p(z; T); (3.16) 
rEZ 
Vabl z + T; T) h S eril(a t rer - 2(z + b)(a t r) f 2r(a t r)| = (3.17) 
rEZ 
yee tr+1)?r—r-+2(z+b)(a+r+1)—22—2b] _— o anib ein(-T-22) 9 p(z; 7). (3.18) 


rEZ 


Also each Vab(z; T) is obtained from ©(z;T) by translation in the argument z and 
multiplying by a nowhere vanishing factor. 


O(z +b+arT;T) = 9 e"ilr?’r+2(z+b+ar)r] — 
rEZ 





`> eTilla+r)?’T+2(2 Hb) (r+a)—a?r—2(z+b)a] = eTil? T+2(z+b)a) 9 


rEZ 


lT]: 


Or, equivalently, 
Valz T) = eit (a?r +2(z2+b)a) 900 (z +b+arT;T) (3.19) 


Let us set 


Th(k; A-)ab = {f € O(C): f(z +m + nr) = e7? manb) —2kni(nzt y D Fl}. 
(3.20) 
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Its elements are called theta functions of order k with rational theta characteristics (a, b). 
It is easy to see that the multiplication of functions defines a bilinear map 


Th(k; A; ab x Th(k’; Araw —> Th(k + Re Ar) (ata) (btb): 
For any f € Th(k; A) we have 


eTila?’r+2(z+a)b] f(y + 2+ ar) E Th(k; ab: 


In particular, there is a canonical isomorphism 
Th(k; A-)ab S Th(k; AS) 
Also observe that 


Th(k; A-)ab = Th(k; A Jay ifa -a,b —bEZ. (3.21) 


3.4 Now we are ready to use theta functions to embed E = C/A in projective space. 


Lemma 3.1. Let f be a nonzero function from Th(k; A). Then f has exactly k zeroes in 
C modulo A counting with multiplicities. 


Proof. We use a well-known formula from the theory of functions in one complex vari- 
able: the number of zeroes (counted with multiplicities) of a holomorphic function f(z) 
on an open subset D of C inside of a compact set K contained in D together with its 
oriented boundary T is equal to 


1 
Z= 5 [aloe teas 


Here we also assume that f(z) has no zeroes on I’. Let us take for K a small translate 
zo + II of the fundamental parallelogram of the lattice A such that its boundary I does 
not contain zeros of Vab(z; T). It is easy to achieve since a holomorphic function has a 
discrete set of zeroes. Using that 


f(z+m+nrT) = eTTikln?r+2nz) F(z) 
we obtain 


1 1 zo+1 
Z= za | doe re = =f dlog f(z) — dlog f((z + 7;7)— 


20 
1 ZQO+T zo+1 
— | =al j= ik(2 =k. 
mj Mosse) — dog set ir)= ggf advik(a2 +7) 


Since each zero of Jap(z; T) can be translated to a zero of Vay(z; T) inside K by means of 
a vector from the lattice, we obtain that the zeroes of J,,(z;7) form k orbits with respect 
to A. This proves the assertion. 
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Corollary 3.1. The zeroes of the function a(z; T) in C are the points 


1 1 
z=(ats)r+(o+ 5) +A. 


Proof. Using the formula (3.19) itis enough to verify that the function 71 1 (z; T) vanishes 
22 
at the origin 0. This will follow from the fact that this function is odd. We have 


Val —z; T) = > eTillatr)?’r+2(—2+0)(a+r)] — 
rEZ 


D aaa = Va olz; T). (3.22) 
rEZ 











Taking (a,b) = (5, 5) and using (3.15) we obtain what we want. 





Corollary 3.2. The set of zeroes of Os(z;T)x consists of the points 


s 1 lj 
G + ait + 
Proof. Use (3.10) and Lemma 3.1. If z is a zero of O,(z;7T), then kz is the zero of 
V so(z; kr). Thus 








gil 1 
kz=(-+- = +Z + Zkr. 
z C 5)kr+5 + + Zkr 
This gives 
s 1 1 1 
z=(7 t35) +g t Zt Z. 














Theorem 3.2. For each k > 1 the formula 
z > (Oo(z;T)k, -< -, Ok-1(2;T)k) 


defines a holomorphic map $p : E, = C/(Z + Zr) => CP*Ł. If k > 3, this map is a 
holomorphic embedding (i.e. injective and the derivative at each point is nonzero). 


Proof. First of all the map is well-defined. In fact all theta functions ©s(z; 7), correspond 
to the same theta factor, hence when we replace z with z+ A, A € A, the right-hand 
side is multiplied by a non-zero scalar and hence defines the same point in the projective 
space. Also we see from the previous corollary that the functions ©,(z; 7), do not vanish 
at the same point, hence not all coordinates of the vector (z) are zero. The map is 
holomorphic since the theta functions are holomorphic functions. Let us show that it is 
injective when k > 3. Suppose (z1) = O¢(z4), or døg(z1) = 0. Using the formulae 
(3.10) and (3.20), we see that, for any integers m, n, 
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O,(z + i + TT) h = Vao(kz +m + n7; kr) = 
eee gnia ake). o(kz;kr) =e 8 ARA aira. (3.23) 


This shows that (21+ + 5?) = b(2,+ 2+ 5), or dgoy(z1) = dbg(zit $+) =0. 
Note that, if k > 2 we can always choose m and n such that the four points 21, 24, 22 = 
a+ P+, 2 = 2+ 7 +% are distinct. The linear space generated by the functions 
©O,(z;7)x is of dimension k. So, if k > 3, we can find a nontrivial linear combination f 
of these functions such that it vanishes at z1, z2 and some other k — 3 points z3,..., Zķ—1 
which are distinct modulo A. But then f also vanishes at 2} and 25, or f has a double zero 
at zı and z2. Thus we have k+ 1 zeroes of f counting with multiplicities. This contradicts 
Lemma 3.1 and proves the assertion. 














Remark 3.1. Let us consider the group EA /A. If we consider it as a subgroup of C/A we 
see that 


TMA = {a E€ C/A : ka = 0} 


is the subgroup kE of k-torsion points on the elliptic curve E = C/A. The group E acts 
by translations on E and on the space of functions V; generated by O,(z;7),. In fact, we 
have 


1 mis 
O.(2+ Rite Sei ©Os(z;T)k; 


as we have already noticed in the proof of Theorem 3.2. Also 








Os(2 +>; 7)k = Vsolkz +T; kT) = Yer ts)? Etat Ear +s)) — 


k 
i Lety]\2r4 Lot SS i Liat (3.24) 
5 eit (kr tst+1)?7+22(kr+s+1)—2z—F)] — ë mi(2z-4 z)Os41(2; T)k, 


rEZ 





where O (z; T) = O0(2;T)k- 
Example 3.2. Let us take k = 3 and find the image of the map 
$3: E, > CP’. 


Consider the action of the group G = $A /A on CP? by projective transformations defined 
on generators by the formula: 


(1/3) « (£0, 21,22) = (ao, 7/344, e7 t2); 


(3.25) 
(T/3) * (£0, £1, £2) = (£1, £2, £0). 
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Then it follows from the previous remark that the map ¢3 is G-equivariant if we make 
G act on E by translations. This implies that the image of E, must be invariant with 
respect to the action of G as above. It follows from the remark made after the statement 
of Theorem 3.1 that for any homogeneous polynomial F (To, T;, T2) of degree 3 the theta 
function F(Oo(z; T)3, 01(z;7)3, O2(z;7)3) belongs to the space Th(9; A) of dimension 
9. On the other hand the space of cubic homogeneous polynomials in three variables is of 
dimension 10. This implies that there exists a cubic polynomial F such that 


F(Oo(z; T)3, O1(2; T)3, O2(2;7)3) = 0, 


so that the image C of ¢3 is contained in the set of zeroes of the homogeneous poly- 
nomial F (zo, £1, £2) in CP. As we already noticed any compact closed subvariety of 
P” (C) must be the set of zeroes of a system of homogeneous equations. Some elementary 
algebraic geometry (or better commutative algebra) tells us that C is the set of zeroes of 
one polynomial. The degree of this polynomial cannot be less than 3. In fact any polyno- 
mial of degree 1 defines a a complex manifold isomorphic to P'(C) hence homeomorphic 
to a two-dimensional sphere. But C is homeomorphic to a torus. Similarly a polyno- 
mial of degree 2 defining a complex manifold can be reduced by a linear homogeneous 
transformation to the form Ta + xız2. Hence it defines a complex manifold isomorphic 
to P!(C) (use the projection map (xo, £1, £2) — (xo, £1)). So we see that C is the set 
of zeroes of F. The polynomial F must be a common eigenvector for the action of the 
group 1A /A = (Z/3)? on the space W of homogeneous cubic polynomials given by the 
formula (3.25). Also it satisfies the condition that its partial derivatives have no common 
zeroes. It is easy to see that this is possible only if F = r? + x3 + £3 +-yx0r122 for some 
scalar y. This implies that the image of #3 is the plane projective curve 


xe + r? + r3 + yx0%1%2 = 0. (3.26) 


Since E, is a compact complex manifold of dimension 1, it is easy to see that it must 
be equal to the whole curve. Also since it is a manifold the partial derivatives of the 
polynomial in (3.26) do not have a common solutions in P? (C) (see Exercise 3.2). This 
easily implies that 

y3 £ —27. 


The equation (3.26) is called the Hesse equation of an elliptic curve. So we have proved 
that any elliptic curve is isomorphic to a complex submanifold of the complex projective 
plane given by the Hesse equation. 


Remark 3.2. Consider the affine part of the Hesse cubic where xo Æ 0. It is isomorphic 
to the curve C’ in C? given by the equation 


1+a°+y?+ yxy =0. (3.27) 
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It follows that the functions 


define a surjective holomorphic map C? \ Z — C’ whose fibres are equal to the cosets 
z + Z + TZ. Here Z is the set of zeroes of Og(z;7)3. Observe that the functions ®(z) 
and ®5(z) are elliptic functions with respect to A, i.e. meromorphic functions with the set 
of periods A. In other words we have succeeded in parametrizing the cubic curve (3.27) 
by double-periodic functions. For comparison let us consider a homogeneous equation 
of degree 2. Applying a homogeneous linear transformation we can reduce it to the form 
x2 — x? + x3 = 0 (if it defines a complex submanifold). Dehomogenizing, we get the 
equation of a (complex) circle 
S: r? +y =l. 

In this case its parametrization C — S is defined by one-periodic holomorphic functions 
cos 27z, sin 27z . Its fibres are cosets z+ Z. One of the best achievements of mathematics 
of the last century is the Uniformization Theorem of Klein-Poincaré which says that any 
equation f(x,y) = 0 defining a Riemann surface in C? admits a parametrization by 
automorphic functions. Its group of periods is not commutative in general. 


Exercises 


3.1 Using Exercise 2.12 show that the equation z3 +y?+ z°+yayz = 0 defines a complex 
manifold of dimension 1 in P?(C) if and only if y? + 27 4 0. 
3.2 Show that the image of a 3-torsion point of C/A under the map ¢3 is an inflection 
point of the Hesse cubic (a unique point at which some line intersects the curve with 
multiplicity 3). Find the projective coordinates of these points. 


3.3 Show that for general value of the parameter ~y the group of projective automorphisms 

of the Hesse cubic is of order 18. Show that it is generated by translations z+a, a € 3A /A 

and the inversion automorphism z — —z of the corresponding complex torus C/A. Find 

the corresponding projective automorphisms of the Hesse cubic. 

3.4 Show that the image of 2-torsion points on the Hesse cubic are the four points (0, 1, —1), (1, a, a), 
where a is a root of the cubic equation 2t? + yt? + 1 = 0. 

3.5 Find the values of the parameter y in the Hesse equation corresponding to the har- 

monic and anharmonic elliptic curve. 

3.6 Show that the parameter y in the Hesse equation (3.26) is equal to the following 

function in 7: 


Voo (0; 37)? + g!/?0o0(7; 3T)? + G?000(27; 37)? 
q?/®9q9(0; 37) Yo0(T; 87) Vo0(2T; 37) 
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3.7 Analyze the proof of Theorem 3.2 in the case k = 2. Show that ¢2 defines a holo- 
morphic map E, — P!(C) such that for all points x € P!(C) except four, the pre-image 
consists of 2 points and over the four points the pre-image consists of one point. 


3.8 Show that the map C — P*(C) given by the formulas 


z => (dolz), 9492), doz (2), P12 (2)) 


1 
2 
defines an isomorphism from C/2A & C/A onto a complex submanifold of P4(C) given 
by two homogeneous polynomials of degree 2. 

3.9 Let X be a complex manifold of dimension 1 in P*(C) defined by two homogeneous 


equations of degree 2. 


(i) Show that, after linear homogeneous change of variables X can be defined by the 
equations z2 + az? — z3 = 0, 22 + bz? — 22 = 0 for some nonzero a,b € C. 


(ii) Show that each X as above is isomorphic to an elliptic curve. 


(iii) Find the values of (a, b) in (i) such that the corresponding elliptic curve is harmonic 
(resp. anharmonic). 


3.10 Show that each 7,,(z;7) considered as a function of two variables z, rT satisfies the 
differential equation (the Heat equation): 














0” f (2,7) Of (2,7) 
Ani = 
Oe" Or 
3.11 Check the following equalities: 
1 TIT T+1 RiT T. 
Doo(0; T) = Vox (537) = -67 (— 7) M919(537): 
81(0;7) = 9 te y= mir /49 (fe ) mir /4,9 (= ); 
04 T) = Volgi T ve 4 2 ST 44 gtk 
1 TIT T+1 TIT T 
B19(057) = -b1 (557) = Dg (57) = eol) 


3.12 Prove that, for any w € C, the product Jap (z+w;7)Vay (z—w;T) is a theta function 
of order 2 with theta characteristic (a+a’, b+b’). Deduce from this the addition formulae: 


voi (0)Io1 (z + w)8q1(z — w) = Voi (2)? V1 (w)? — vaa (z)"V11(w)?, 
919(0)7019(z + w)Dr9(2 —w)= 91 9(z)?919(w)? — va (2°01 (w), 
voa (0) 9a 1 (2 + w)P11(z = w) = V1 1 (2) 001 (w)? — Va (2)79 1 (wW). 
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Lecture 4 


Theta Constants 


4.1 In this lecture we shall study the functions of 7 equal to (0, T) where J(z,T) is a 
theta function. To show that they are worth of studying we shall start with the Riemann 
theta function O(z;7). We have 


0(0,7) = 5 emir — `. q”, q =e". (4.1) 


rEZ rEZ 
We have 
2 2 = 
@(0,r)* = 5 a as Gri tatte = So alng, 
r1EZ TREZ n=0 
where 


en(n) = #{(r1,...,7e) EZE in ar? +... 473}. 


So ©(0,7)* is the generating function for counting the number of representations of an 
integer as a sum of k squares. For example c3(6) = 24 since all representations of 6 as a 
sum of 3 squares are obtained from 6 = 2? + 1 + 1 by changing the order and signs. 

Let us show that 0(r) = @(0,7)* satisfies the following functional equation: 


O(—1/r) = (—ir)*/?0(r), Vr +2) = V(r). (4.2) 


Here in the first equation we take the branch of the square root which is positive on the 
purely imaginary ray iR>o. The second equation follows immediately from the Fourier 
expansion. To prove the first one we use the Poisson formula in the theory of Fourier 
transforms. Recall that for any rapidly decreasing at infinity smooth function f on R” one 
defines its Fourier transform f by the formula: 


fœ) = f EEA Sejak, 
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Let A be a lattice in R” and A be the volume of its fundamental parallelopiped. Let 
A’ = {x € R": x-v © Z,VYv € A}. Then the Poisson formula says that 


5S JOA fo: (4.3) 


xEA yE’ 


We apply this formula to our case. Take n = 1 and A = Z and f(x) = e772? considered 
as a function of x € R. Then the left-hand side of (4.3) is equal to O(0, iy). Now the 
Fourier transform of f(x) is easy to compute. We have 


f(x) = B eritem udi = 1. eT) ema? /u gy = 


=09 


eh | en Tut dt — eld Jj = f(-1/y)/Vy- 


This verifies (4.2) when we restrict 7 to the imaginary axis T = zy. Since the set of zeroes 
of a holomorphic function is discrete this suffices. 
Note that if k = 8n, (4.2) gives 
—1 
8(—) = 79. (4.4) 
T 
We shall interpet this later by saying that 0(0;7)* is a modular form of weight k/2 with 
respect to the principal congruence subgroup T (2). 
To give you an idea why the functional equation of type (4.2) is useful, let me give 
one numerical application. Take 7 = ix purely imaginary with x > 0. Then (4.2) gives 


eet _ 1 rr? je _ 1 > —ar?/a 
Sve eee ee ) 


rEZ rEZ 


Suppose we want to compute the value of the left-hand side at small x. For x = .001 we 
need fifty terms to reach the accuracy of order 1071. But now, if we use the right-hand 
side we have 

Xo eT = 10(1 + 2e7 1" +), 

rEZ 


—1007 | 107434 


Since e we need only two terms to reach the accuracy of order 107400, 


4.2 We know that the zeroes z of O(z, T) = Voo(z;T) satisfy 
2z = (1+ 2m)r + (1+ 2n). 


Then 





eteniz = ern”). 


4] 


where we consider only positive m. Let q = e?" be as before, and consider the infinite 
product 


oO 


P(z;q)= [[ (0+ 2 e1 Hg 2 eis), (4.5) 
m=1 
Recall that an infinite product []?°_, fn of holomorphic functions on an open subset U 
of C represents a holomorphic function equal to lim y—oo Ths fn if for any compact 
subset K of U the series $57; (1 — fn) is uniformly convergent. 
Since |q| < 1, the infinite series 


oo 
ern y —2riz 
> q > pe} 
m=1 


is absolutely convergent for any z and the infinite series (4.5) is a holomorphic function 
on C. Its zeroes are the same as the zeroes of O(z,7). This implies that 








O(z,7) = 9oo(2s7) =Q) [J a+ qt erriz) (1 4 gua e 2m) 








Dq 2? @ 7); (4.6) 


-oT[0 


919(z57) = QqE (e miz tig) [[a ge mules Tal Age AMEN: (4.7) 


m=1 


V (z; T)= iQq5 (e miZ er) II (1 _ qe laa (Gl = ge (4.8) 


11 
22 


Plugging in z = 0 we get 








Vor 
= mat (4.9) 
919(0;7) = 2Qg8 [] (+4); 
m=1 
va ı (0; T)=0 
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(z; T) in z, we find 


Differentiating 011 

22 

i OO 
TANA E z my3 
v11 (0; T) = —20Qqe ][a-2*. (4.10) 
m=1 
To compute the factor Q we use the following: 
Theorem 4.1. /C. Jacobi]. 
Via = —TU V1 9Vg1- 
22 2 2 


Here, following the classic notation, we set 
Vab (03 T) = Vab, 


dalz; T) — ag 
dz (0) — “ab: 


Also notice that when (a, b) = (€/2,/2) where €, ņ = 0, 1 the classic notation is really 
Ben(z; T) = Denz; T). 

However we keep our old notation. 

Proof. Consider the space Th(2; A)qp with a,b = €/2,€ = 0,1. Its dimension is 2. If 


(a,b) = (1/2, 0), the functions volz; T)Voo(z;7) and O11 (2; T)Oo1 (z; T) belong to this 
space. It follows from (3.12) and (3.7) that 


D1 9(2 T), Yoo(Z3T); Do. (z;7) are even functions in z, 


011(z;7T) is an odd function in z. 
2.2 


Thus 01,9(2;7)Vo0(z;7) is even and 711(z;7)Ug1(z;7) is odd. Now consider the func- 
2 2 


11 
22 
tion 

F(z) = Oabl zi 7) Bary (Zi T) — Dabl zi TV Vaw (237). 


Observe that F(z) = Vablz; T)? (Var (2; 7) /Van(z;7))’. The function wee is almost 
periodic with respect to A, that is 





Bay (zt+M+n73T) _— onitm(a’—a)—n(b'—b)] Vary’ (257) 
=e AAE 
Valz +M+NT;T) Dabl z; T) 


This implies that F(z) € Th(2; A)z2a+a'—a,2b0+0' -b = Th(2; A)Ja+a' b+v.- In particular, 


1 (2; T): (2; T)— voz (2; 7)'91 1 (23 T) € Th(2; A)19. 
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Since this function is even (the derivative of an odd function is even, and the derivative of 
an even function is odd) we must have 





911 (257)'9q1 (257) — Do1 (25 7) V11 (257) = CD19(2;7)Vo0(z 7), (4.11) 
for some constant c. Since vı ı (0; 7) = 0, we get 
22 
v 1V91 
a 
1 9Vo0 


Differentiating (4.11) twice in z and plugging in z = 0, we obtain 


ag! Dor — gil / = V 
2 


11 1 11 
35 053 39 


Doz 9 19%00(V' 4900 + 1900). 


/ 
11 
22 


This gives 





Now we use the Heat equation 
8?Vabl z; T)(z, T) 
02? Or 


(see Exericse 3.10). This allows us to rewrite the previous equality in terms of derivatives 
in T. We get 





=0 (4.12) 





dlog vii dlog Voi V 10V00 
22 = 2 2 ‘ 
dr dr 
Integrating, we get 
8 1 = adı V1 9V00, 
22 2 2 


for some constant a. To compute a we use (4.14) when q = 0 (i.e. taking Im 7 go to 
infinity). This gives a = —7z. The theorem is proven. 














4.3 Now we are in business. Multiplying the equalities in (4.9) and comparing it with 
the equality (4.10), we obtain from the Jacobi theorem 4.1 


OO 


1 
~2nQq? II (1—qg™)? = a — T9019 19 = 


m=1 


—2nQ'qs [] (-a""2)? [] a+a""2)? [[ tan). 


m=1 m=1 m=1 
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This gives 





Here again we fix the sign in front of Q by looking at the value of both sides at q = 0. 


Replaing q with t? and using the obvious equalities 


(4.13) 


(4.14) 


Here comes our first encounter with one of the most notorious functions in mathematics: 


Definition. The Dedekind n-function is the holomorphic function on the upper-half plane 


defined by 


Thus Q = q1 ?n(T) and we can rewrite (4.5) in the form 


Boo = n(7) f(r)”, 
Jor = nir) (r)? 
vi = n(T)f2(T)? 


v = —2nn(7)?, 


Nin > 
Nir 


(4.15) 


(4.16) 


where 


q7i/48 To qg 5 
n=" TT 
m=1 
fo(r) = V2q/4 [] +a”) 
m=1 


They are called the Weber functions. 


4.4 Let us give some applications. 


We have z 
T 
Vool (z; T) =) e"l (2rz+ir? T) =) qZ", 
rEZ rEZ 


where q = e277, v = e?"". It follows from (4.4) and (4.9) that 


dolz) = [[Q-a™ +a = v(l+q 2 v)). 


Comparing the two expressions we get the identity 





SaPo = [[ 0- 04 0 g). 








rEZ m=1 
Here are some special cases corresponding to v = 1 and v = —1: 
r2 2 m 2m+1 2 
Soa? = [[Q-a™)ta® ), 
rEZ m=1 
r2 cae 2m=1\9 
X iya? = [[a-g@a-@= ). 
rEZ m=1 


To get more of this beautiful stuff, let us consider the function 711 
62 


have i 
— pTi/6 ,mir/12.q¢> l. = 
911 (0, 37) ele ws + 5387 )oo 
OO 
em/6 emit /12 Tl ern) {I (1 E emer Ory E eime — 


m=1 
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(4.17) 


(4.18) 


(4.19) 


(4.20) 


(4.21) 


(4.22) 


(0,37). By (3.19), we 
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CoO 
e7/6 e7iT/12 II (1 _ q”). 
m=1 
On the other hand, we have 


(0,37) = 5y eirl(m+4)?37+2(m+5)4] — 
mEZ 


0 


Ll 
62 


oo 
ett /6 717/12 `> (—1) "ere? +m)r — e7t/6 e7îT/12 > Kg). 
m=0 meZ 
This gives the Euler identity 


N (erg 6r — II (1 _ q”). (4.23) 
rEZ m=1 


In particular, we get the following Fourier expansion for the Dedekind’s function 7(7): 


nr) = g(a ger VP? 
rEZ 

The positive integers of the form n + (k pa ,n = 1,2,... are called k-gonal 
numbers. The number of beads arranged in the form of a regular k-polygon is expressed 
by &-gonal numbers. In the Euler identity we are dealing with pentagonal numbers. They 
correspond to the powers of q when r is negative. 

The Euler identity (4.23) is one of the series of MacDonald’s identities associated to 
a simple Lie algebra: 





oO 


X and = [0-an 


rEZ m=0 


The Euler identity is the special case corresponding to the algebra sl (2). 


Exercises 


4.1 Let p(n) denote the number of partitions of a positive integer n as a sum of positive 
integers. Using the Euler identity prove that 








p(n) — p(n — 1) — p(n — 2) + p(n — 5) +... + (—1)"p(n 
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Using this identity compute the values of p(n) for n < 20. 
4.2 Prove the Gauss identity: 


oo —1 oo 
2 i- Por) ( [[a = pan) = 9 grr +1)/2 
r=0 


n=0 


4.3 Prove the Jacobi identity: 


Jc _ z”)? = X (-1)"(2r a Herre 
n=1 r=0 


4.4 Using (4.2) prove the following identity about Gaussian sums: 


q-1 1 p-l ; 
gar ble = See ee 
pe =0 


=> 


va r=0 


Here p, q are two coprime natural numbers.[Hint: Consider the assymptotic of the function 
f(x) = O(0; ix + a when z goes to zero.] 
4.5 Prove the Jacobi triple product identity: 


[0-0ta bet) = ge”, 


rEZ 


3 
ll 
m 


4.6 Prove a doubling identity for theta constants: 
voz (27)? = voo(T)Vo1 (T): 


(see other doubling identities in Exercise 10.10). 


4.7 Prove the following formulas expressing the Weber functions in terms of the n- 
function: 


He) = 2/8 TS nr), h) =EN), tC) = Vaen). 


4.8 Prove the following identities connecting the Weber functions: 


DHIT lT) = fa (27)f2(T) = V2. 
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LECTURE 4. THETA CONSTANTS 


Lecture 5 


Transformations of Theta Functions 


5.1 Let us see now that the theta constants Vap and their derivatives De satisfy the 
functional equation similar to (4.2). This will imply that certain powers of theta constants 
are modular forms. For brevity we denote the lattice Z + TZ by A,. 


Theorem 5.1. Let 0(z;7) € Th(k; A-)ap and M = (: ) € SL(2, Z). Then 











a pC EP TE 

e 7 Carer eran (k; rab, 
where } LBS 

(a,b!) = (aa + qb- 2, Ba + 8b + 2) 


Proof. First observe that for any f(z) € Th({e,}; A) and t € C*, 
2 
(2) = f=) € Th({ey}; tA), 


where 


In fact, for any \’ = tÀ € tA, 





i z+tà Z z 
ole +N) = EZE) = FE A) = AÉ) = ex (Dole), 
We have 
Th(k; Ar)ab = Th({e,}; Z + TZ), 
where 


em+nr(2) = e2ti(ma—nb) .—mik(2nz+n?7) | (5.1) 
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For any M = 6 a € SL(2, Z) we have 


(y7+0)Z+(arT+h—)Z=Z+47Z. 
Thus for any V € Th(k; A,)ap, we have 
O(z(yr + 8)) € Th(e},;Z + 7’Z), 


where 





, at+ 6B 
i ; 
yT +6 


Emtnr (2) = €(mtnr')(yr-+4) (ZCIT + 8). 
We have, using (5.1), 


e(z) = Cyr 46(2(YT + 5)) Z e2ti(ad—by) o= Tik(2yz(yr+8)+7°7) — 


e Tiky((y7 +6) (2+1)?—(y7+6)27) orikyð 2ri(að—by) 
This shows that 
ero)" Th {ey (2)}; Z + 7/Z) = Th(feh(z)}sZ + 7'Z), 


where 
el (z) = eTilkyð+2(að—b7)] (5.2) 


Now comes a miracle! Let us compute e”, (z). We have 


erent err 20 6 aa llr +6) = 


etiky(yt+6)(227'+1'?) €gtar(z(yT + 6)) = (5.3) 


et tklyt +5) (2z27/+1'2)—(2az(yr+6)4 a?r) ,2mi(—ba+Ba)] 


e” (z) = 








= eit tkG (2ni(—ba+ Ba) 


where 

G = (qr + 6)(227' + T°?) — 2az(yr + ô) — a? = 
at+ 6B 
yt + 5? an 
(ar +9)” 

yT +6 
ylar + B)? — a*r(yr + ô) 
yT +6 


ar+ 6 
yT +6 





lyr + 4)(22z( )?) — 2az(yr + ô) — a?r = 


2zy(at + B) + — 2a2z(y7r + 6) — a?r = 





22-4 
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Here we used that ad — By = 1. Now 
ylar +8)? — a2r(yr +6) = yabr +78? — a?r = 
a(ad — By)r + aB(y7 + 6) — B(ad — By) = —(ar + B) +ab(yr + ô). 


This allows us to rewrite G in the form 


at taf = -2z -T + aß. 





G= 





Putting G back in the expression (5.3) we get 


e! (z) = e` Tik(2z+1') orilkaB—2(Ba—ab)] | 


T 


Together with (5.2) this shows that 
Th({e%(2)}, Av) = Th(k, Araw, 


where i k 

(a’,b') = (6a — 904 —Ba+ ab — $8), (5.4) 
Summarizing we obtain that, for any (z, T) € Th(k; A;)ap, 

elt h(t +8)2" 9 (yr + ô)z; T) € Th(k, A; )ay. (5.5) 
Now let us replace (: a ) with its inverse ( = 7 ). We rewrite (5.13) and (5.14) as 

eTinkyl -yT +0)2 9 (aye +a)z; T) € Th(k, Av Jay, (5.6) 

where a kó 

(a', b") = (aa + yb — “>, Ba + ôb + =) 





It remains to replace 7 with arts in (5.15) to obtain the assertion of the theorem. 














Substituting z = 0 we get 
Corollary 5.1. Let 3) (z,7),..., Ok(z; T) be a basis of the space Th(k; A-)ap and 
Vi (z, T), ..-, 0 (z; T) be a basis of Th(k; A) ay, where (a', b’) are defined in the Theo- 
rem. Then, for any M = & 3) € SL(2, Z) there exists a matrix A = (cij) E GL(k, C) 
depending on M and T only such that 


era) 
alo, a = Yat 
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5.2 Let us take k = 1 and (a,b) = (€/2,n/2),€&,ņ = 0,1. Applying the previous 
Theorem, we get 
Valz; T +1) = Cha bta: (2; T) 


for some C depending only on 7 and (a, b). In particular, 


Since the substitution 7 —> T + 1 changes q“ into e2mialt +1) — g%e274 we obtain 


C = e™/4, 


Similarly, using the formulas (4.16) and (4.17) which give the infinite product expansions 
for other theta constants, we find 


volz; T +1) = voz (z; T), (5.7) 
voa (2; T + 1) = voo(z; 7), (5.8) 
D1 9(23 T+1)= =E" tgah; T). (5.9) 


Now take M = (7 a) . We have 
e179 o(z/T; —1/r) = Bvoo(z;T) 
for some B depending only on 7. Plugging in z = 0 and applying (4.2), we get 
B = v-ir, (5.10) 


where the square root takes positive values on 7 € iR. 
In particular, 


Voo (0; —1/T) = vV —iTVo0(0; T). (5.11) 
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Applying the formula (3.19) we have 


1 1 1 





—inz?/T9 2, a a —inz?/Tġ 2 Z. — = 
E AO 2 i olta a 
eire Tgl” + 5 : => = Bet? NT einet) T(z J T> T) = (5.12) 
T T 2 
Bel™(/42) Bool + : 7) = B819(257). 
In particular, 
0(0; —1/7)o1 = y —irð(0; T)io (5.13) 


Replacing here 7 with —1/7, we obtain 


V(0; —1/7) 19 = Vir (0; T)o1. (5.14) 
This shows that o, 
e 17 9( 2/7; —1/T)19 = Vit 0(%T)o1- (5.15) 
Finally, using (5.13), (5.14) and (5.15) and applying the Jacobi theorem, we obtain 
0(0;-1/7)11 = -TV —i7 311 (0; PY (5.16) 
22 
We know from Theorem 5.1 that 
—inz?/r rom — Bp : — _f’ ; 
e O(z/r; I/r)a1 BO(z;7)1 2 Bo(z;7)11. 


for some constant B’ depending only on 7. Differentiating in z and setting z = 0 we 
obtain 


1 
—9(0;—-1/r)41 = BY 


1l 
22 


Comparing with (5.16), we get B’ = B and hence 
(5.17) 


5.3 We shall interpret the previous computations later by saying that powers of theta 
constants are modular forms with respect to certain subgroups of the modular group. Right 
now we only observe the following 


where ((M)§ = 1. 
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Proof. We shall prove in the next lecture that it is enough to check this for generators 
of the group SL(2, Z). Also we shall show that the group SL(2, Z) is generated by the 
matrices Mı = ($+), M2 = (9 >|) ,—J. We have from (4.14) and (4.15) 


f(r ++I =f), FOLT? = 7? f(r). 


This proves the assertion. 














Corollary 5.3. Let n(T) be the Dedekind n-function. Then 





wr =q][Q-a")*, qa" 
m=] 
satisfies 
aT + Boa 12 24 
(EE = Cor +8) nir. 
Proof. Use (4.10) 
94, = —2an(r)? 
22 














Corollary 5.4. Let M = a 5) € SL(2,Z). Assume that the products aß, y8 are 





ð 
even. Then 
u tP) = C(yr + 8) 2e" /0T +O (z; r) (5.18) 
qr +6 yr +6 E 


where CÈ = 1 and the branch of the square root is chosen to have non-negative real part. 
Proof. Recall that O(z;7) = Voo(z;T), so Theorem 5.1 gives immediately that 


z oar+p 
yT +6’ yrt+o 








) = e(M, r)e™17/07+9) Q(z; 7) 


for some constant c(M,T) depending only on M and r. Take M = (2 We Then 
formula (5.13) checks the assertion in this case. Take M = ( i t4 ). Then the assertion 
follows from (5.10). Now we argue by induction on |y| + |ô]. If |6| > ||, using that 
O(z,7 + 2) = 0(z; T), we substitute 7 + 2 in (5.16) to obtain that the assertion is true for 
M' = Q B+2a 

y 6424 
by induction. Note that we used that | + 27| is not equal to |ô] or |y| because (y, ô) = 1 
and qô is even. Now, if || < |y|, we use the substitution 7 + —1/7. Using (5.13) we 














. Since we can decrease | + 27| in this way, the assertion will follow 





see that the asssertion for M follows from the assertion fo M’ = (3 oe ). This reduces 














again to the case |ô| > |y]. 
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Exercises 


6-1 


5.1 Show that the constant ¢( M) in (5.16) is equal to i 7 (oy) 
odd. If y is odd and 6 is even, it is equal to e~7"7/ “(eh Here (3) is the Jacobi-Legendre 
symbol, where we also set (9) = 1. 


when y is even and ô is 


5.2 Extend the transformation law for theta functons by considering transformations de- 


fined by matrices Cc A ) with determinant n not necessary equal to 1: 





nbz" nz at+B6 
= ; 
qr +6 r+ 





) E€ Th(nk, Arab, 


where (z; T) € Th(k, A;)ay and 


kôß 


k 
(a,b!) = (aa + yb- Ba + 8b + >). 


5.3 Using the previous exercise show that 


(i) AD 11 (2; <j2)= Bon (z; T)O11 (2; T) for some constant A; 


(ii) ANI iolz; T/2) = voo(z; T)O14(23 T) for some constant A’; 
(iii) (Gauss’ transformation formulas 


919 (0; 7/2)011 (2; 7/2) = 2019(2;7)011(z;7), 


z 1 
2 2 


qs (0; 7/2)01.4(2; T2) = 2vo0(2;T)V10(2; T), 


[Hint: Apply (3.19) to get A = A’, then differentiate (i) and use the Jacobi theo- 
rem]. 


5.4 (Landen’s transformation formulas) Using Exercise 5.2 show 


91 (0; 27)d (23:27) = Dry(2i7)0 


11 
22 


qs (0; 27 )Vo1 (2z; 27) = Yoo(z;7)991 (257), 


1 
2 
5.5 Let n be an odd integer. 


(i) Show that, for any integer v, Vp 1 (4;7) depends only on the residue of v modulo 
n. 
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(ii) Show that 


n-1 F n-1 2y 
IETA = [0o Gan 


Gii) Using Exercises 5.3 and 5.4 show that 


voo(z; 27) 1o(2; 27)vo (2z;2r) = Voo(z; T)O1 9 (23 T)V 
Vo0(0; 27)01.9(0; 27)vo1 (0;2r) — 2oo(0; T)O14(0; TB 





(iv) Show that the expression 


Izi Pooli T) olp Toa (37) 
doo (0; 7)"-19 1.9(0; 7)" Mp1 (O37) 





does not change when 7 is replaced with 27. 
(v) Show that 
-1 
Lizi bolp 781973 T) Vo (RT) 7 
Voo (0; 7)" 1014(0; 7)" 1891 (0; 7)" k 





n—1 
Iži P00( kT) 19557) Po (F537) j 
Voo (0; T)” =1019(0; T)P Vga (One 


’ 





(-1)"F ( 
(vi) Prove the formula 


n-1 
Iži Pools T) oC 7) 801 (457) a 
-7)\n-1 .\n—1 .7\n-1 
Voo (0; T) 91 9(0; T) Yon (0; T) 








5.6 Let A = Zw + Zw. Set t(z; w1, w2) = vaalii a 
(i) Show that 
t(z + W13W1,W2) = —t(z; w1, w2), 
pg sewn 
t(z + w;w1, w2) = —e “1 t(Z;W, w2). 


(ii) Let wi, w, be another basis of A. Show that 


t(z; w1, wh) = Cet tozi; w1, W2) 


for some constants C, a, b. 
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Gii) By taking the logarithmic derivative of both sides in (ii) show that 








t” (0;w1, w2) t” (0; wi, wh) t” (0; w1, w2) 
a= } 4 = 7 
6t (0;w1, w2) 6t (0; w, w5) 2t' (0; wy, w2) 
and i 
C t (0; w1, w3) 
t (0; w1, w2) 
(iv) using (iii) show that 
m (0) UAN (0) 
a l 22 
60% ı (O)uz 69 (Ow? 
22 2.2: 


and b = 0; 


(v) using the Heat equation (see Exercise 3.8) show that 





ol”, (0) 
aa sip En 
O 


where 7 = 22., 
w 


5.7 Define the Weierstrass o-function by 
O01, w2) = ne ae res ú 
Show that 
(i) a(z; w1, w2) does not depend on the basis w1, w2 of the lattice A; 
(ii) o(—z) = —a(z); 
(iii) 


olz +w) = -enet a(z), olz + wy) = —e 4422) 9(z), 
where m = o'(w12)/a (w12); nz = o'(w22)/o (w22). 
Gv) (Legendre-Weierstrass relation) 
mw — Nw = 2ri. 


[Hint: integrate the function o along the fundamental parallelogram using (iii)]; 
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(v) 
_ mi dlogn(r) Ww dlog n(T) T 


m= — 2 = 
w? dr ’ w? dr 2w 





where T = w2/w1. 


5.8 Using formulas from Lecture 4 prove the following infinite product expansion of 


a(z; Wy, w2): 








2 oO —2 Mp2 
wy Me 2 d- gva - gv’) 
a(z; w1, w2) = mil Ose) II (1 — q™)? ; 
m=1 q 
a : 
where q = e? o v = eizh, 


Lecture 6 


Modular Forms 


6.1 We have seen already in Lecture 5 (5.2) and Corollary 5.3 that the functions 0(7)** = 
Yo(0;7)** (resp. n(7)?4) satisfy the functional equation 


f(r+2)= f(r), f(-l/7) = 77 f(r), 


(resp. 
tbs MUNT 


In fact, they satisfy a more general equation 

















ar +B 2k a p 
= T .l 
E orate) vV(o S) en 6.1) 
where T is the subgroup of SL(2, Z) generated by the matrices + (? 3!) ,+ (4?) (esp. 
£ (9!) £4). 
To see this we first rewrite (6.1) in the form 
f(g T)ig(t)* = F(T), (6.2) 
where 3 
: _ @& aT +P 9 
By the chain rule 
jagt(T) = 59 : T)jg' (T). (6.4) 


Thus replacing 7 with g’ - 7 in (6.2), we get 
(O: (I TNIE +7) (7) = F(99' T)gg lT) = F (7). 
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This shows that 
f(r)|eg = f(g naar (6.5) 


satisfies 
Felg) = (FTI), Yg, g' ET. 


In other words (6.5) defines a linear representation 
p:T > GL(O(H)™”) 
of T in the space of holomorphic functions on H defined by 


(9) ($(z)) = oleg}. (6.6) 


1 


Note that we switched here to g~~ in order to get 


p(g9') = p(g) © plg’). 


It follows from the above that to check (6.1) for some subgroup T it is enough to 
verify it only for generators of I. Now we use the following: 





Lemma 6.1. The group G = PSL(2,Z) = SL(2, Z)/{+} is generated by the matrices 


0 -1 1 1 
A A 
These matrices satisfy the relations 
Sai, (ST =1. 


Proof. We know that the modular figure D (more exactly its subset D’) is a fundamental 
domain for the action of G in the upper half-plane H by Moebius transformations. Take 
some interior point z9 € D and any g € G. Let G’ be the subgroup of G generated by 
S and T. If we find an element g’ € G such that g'g - zo belongs to D, then g'g = 1 
and hence g € G”. Let us do it. First find g’ € G” such that Im (g’ - (g - 2o)) is maximal 


possible. We have, for any g = & a 


Im z Im z 
lyr + 6| ~ Iyllz| + lô] 





Img: z= < CIm z, 


where C is a positive constant independent of g. So the set {Im g’-z : g' € G’} is bounded 
and discrete and hence we can find a maximal element. Take z = g - zo. Let g’ realize 
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this maximum. Applying transformations T” we may assume that |Re T” g'g - zo| < 5: If 
|T”g'g - zo| > 1 we are done since z’ = T” g'g - zo € D. If not, we apply S. Then 


/ 


—1 
Im S: 2’ = Im — = Im —, < Imz, 

zl |z"| 
contradicting the choice of g’. This proves the first assertion. The second one is checked 
by direct matrix multiplication. 














This explains why (6.1) is satisfied for the functions 64, and (7). 


Definition. Let T be a subgroup of finite index of SL(2, Z). A holomorphic 
(resp. meromorphic) function f : H — C satisfying 


ar+ 6 
yT +6 





= a 
(SEE) = (or +610) = apt), vo=(% FJ er, 
is called a weak modular form (resp. a weak meromorphic modular form) of weight k 
with respect to P. 


We shall later add one more condition to get rid of the adjective weak”. 


Remark 6.1. . Some authors prefer to call 2k the weight of a weak modular form admitting 
k to be equal 1/2. Since jg has a meaning for any group I acting discretely on a complex 
manifold M, our definition can be easily extended to a more general situation leading to 
the notion of an automorphic form of weight k. 


6.2 Suppose we have n + 1 linearly independent functions fo,..., fn satisfying (6.1) 
(with the same number k). Then we can consider the map 


f:H—>CP”", +> (fo(r),---,fn(7)- (6.7) 


When we replace 7 with eines the coordinates of the image will all multiply by the same 


number, and hence define the same point in the projective space. This shows that the map 
f factors through the map 








f : H/SL(2,Z) > CP”. 


Now recall that the points of H/SL(2, Z) are in a natural bijective correspondence with 
the isomorphism classes of elliptic curves. This allows us (under certain conditions) to 
view the set of elliptic curves as a subset of a projective space and study it by means 
of algebraic geometry. Other problems on elliptic curves lead us to consider the sets of 
elliptic curves with additional structure. These sets are parametrized by the quotient H/T 
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where T is a subgroup of SL(2, Z) of finite index. To embed these quotients we need to 
consider functions satisfying property (6.1) but only restriced to matrices from I’. 

Many examples of such functions are obtained from powers of theta constants. 

We will need one more property to define a modular form. It is related to the behaviour 
of f(z) when Im z goes to infinity. Because of this property the image of the map (6.7) is 
an algebraic variety. 


Let T be a subgroup of SL(2, Z) of finite index. We can extend the Moebius action of 
T on H to the set 
H* =HUQU {oo} = H U P! (Q) 


by requiring that the subset P!(Q) is preserved under this action and the group T acts 
naturally on it with respect to its natural linear action on Q?: 


6 5) ' (p,q) = (ap + bq, yp + ôq). 


In particular, if we identify rational numbers x with points (x : 1) € Pt(Q) and the 
infinity oo with the point (1,0) we have 








ax+ if ap: 
& ee vars | YE 640: (68) 
y ô oo if yx +ô = 0. 
a B ð 
(; 5) 7 
2 jf 0 
(e pes o ENS (6.9) 
y ô œo ify=0. 


Note that SL(2, Z) acts transitively on the set Q U {oo}. In fact for any rational number 
y= A with (p,q) = 1 we can find a pair of integers u, v such that up — vq = 1 so that 


(h)a 
.- = 0 
-q4 P q 


Thus any subgroup of finite index I of SL(2, Z) has only finitely many orbits on QU {oo}. 
Each such orbit is called a cusp of I’. For each cusp c = I- x of T represented by a rational 
number x or oo the stabilizer group Iy is conjugate to a subgroup of SL(2, Z). In fact, 
if g- x = co for some g € SL(2, Z), then 


Since 
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we have 





g Teg c {t(j 1) Bez} 


Let h be the smallest positive 6 occured in this way. Then it is immediately seen that 
g- Tz- g7} is generated by the matrices 


NG) Seal ea 


The number A is also equal to the index of the subgroup g - I, - g~! in SL(2, Z) = 
(7, —J). In particular, all x from the same cusp of T define the same number h. We shall 
call it the index of the cusp. Let f(T) be a holomorphic function satisfying (6.1). For 
each x € QU {oo} consider the function (T) = f (T)|Łg7t, where g - x = oo for some 
g € SL(2, Z). We have 


Pe 





IT) rg HST Ole grg HF Ose HSL Ole = olr). 


This implies that (rT) satisfies (6.1) with respect to the group gryg}. Since the latter 
contains the transformation T” we have 


O(T"- T) = $(7 +h) = (7). 


Thus we can consider the Laurent expansion of $(7) 


(D=> eq, qeaer™. (6.10) 


rEZ 


This converges for all q 4 0. We say that f(T) is holomorphic at a cusp (resp. meromor- 
phic) if ay = 0 for r < 0 (resp. a, = 0 for r < —N for some positive N). It is easy to 
see that this definition is independent of the choice of a representative x of the cusp. Now 
we are ready to give our main definition: 


Definition. A holomorphic (resp. meromorphic) function f(r) on the the upper half- 
plane H is called a modular form (resp. meromorphic modular form) of weight k with 
respect to a subgroup T of SL(2, Z) of finite index if it is holomorphic (resp. meromor- 
phic) at each cusp and satisfies 


FIT) =j TET), Vg Er. 


A modular form is called a cusp form or a parabolic form if its Fourier expansion at each 
cusp has no constant term. A meromorphic modular form of weight 0 is called a modular 
function with respect to I. 
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6.3 Let us give some examples. 
Example 6.1. Let 
A(T) = (7)™. 


It is called the discriminant function. We know that A(r) satisfies (6.1) with k = 6 with 
respect to the group [ = SL(2, Z). By (4.9) 


Since 


A(r)=¢[]@-4@")"* 
m=1 


we see that the Fourier expansion of A(7) contains only positive powers of q. This shows 
that A(T) is a cusp form of weight 6. 


Example 6.2. The function Yo9(7) has the Fourier expansion ) > q/?, It is convergent at 
q = 0. So oon is a modular form of weight k. It is not a cusp form. 


Let us give more examples of modular forms. This time we use the groups other than 
SL(2, Z). For each N let us introduce the principal congruence subgroup of SL(2, Z) of 
level N 


T(N) ={M = 5) E SL(2,Z): M =I mod N}. 


Notice that the map 


SL(2, Z) > SL(2, Z/NZ), é J-C 5) 


is ahomomorphism of groups. Being the kernel of this homomorphism, T(N) is a normal 
subgroup of I'(1) = SL(2, Z). I think it is time to name the group T (1). It is called the 
full modular group. 

We have 


Lemma 6.2. The group T (2) is generated by the matrices 


a Ce Bef E E 
4, =à} 3), srs- (2, 9). 


Proof. Let H be the subgroup of T (1) generated by T?, —I and ST? S71. We know that 
T(1) is generated by T and S, it is easy to verify that H is a normal subgroup of T (1) 
contained in ['(2). Since r(1)/T(2) = SL(2,Z/2) it suffices to show that the natural 
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homomorphism ¢ : [(1)/H — SL(2, Z/2) is injective. Let g € ['(1) \ H be an element 
of the kernel of ¢. It can be written as a word in S and T. Since 


(jag) eS, Gai, Ss S5, 


we can replace g with another element from the same coset modulo H to assume that g is 
a word in S and T where no S? or T? appears. Since we know that (ST)? = STSTST = 
1, we have the following possible expressions for g: 


S, ST, STS, SPST, STSTS, T. 
Here we used that (T.S) = 6(ST)~! since ¢(T? S?) = 1 and similarly 
o(TST) = (STS), (TSTS) = ¢(STST)', 


(TSTST) = o(STSTS)"1. 


Also (ST) = ¢(STST)~!. Thus it is enough to verify that the elements S, ST, STS, 
T are not in the kernel, i.e. do not belong to T (2). This is verified directly. 














Example 6.3. Consider the theta constants Des. Applying the transformation T’ = T + 1 
twice and using formulas (5.1) , we obtain 


Voo(T + 2) = oa (T +1) = Voo(r), 


Vo (7 + 2) = Voo(7 + 1) = v1 (7), 


Nie 


Dig(7 + 2) = el NDo(r +1) = cml? 16(). 
Next, using formulas (5.11)-(5.14), we have 


9o0(ST?ST) = Boo(( 5! 2,)-7) = doo es) = g a *oo( +2) = 


; 1 1 
d o + 2)" vo(-2) = 


f 1 . 
ma ma + 2)!/2(T)!/2900(7) = —i(2r = 1)t/20o0(7). 


E 
Similarly we obtain 


: 1 1 
vo; (ST? ST) = ee + 2)"7919(-= +2) = 


ieil L 4 2)120, (F) = (2r — 1), (7), 
T 2 T 2 
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Vio (ST?S7) = —i(27 — 1)? (7). 
Applying Lemma 6.2, this shows that 
4 4 4 
Boo(t)", Violr),  Vo2(7) (6.11) 


are weak modular forms with respect to the group I'(2). This group has three cusps 
represented by 0, 1, and oo. Since P is generated by the matrices +7”, we see that oo 
is the cusp of T (2) of index 2. Since the subgroup ['(2) is normal in I'(1) all cusps have 
the same index. Also it is enough to check the condition of holomorphicity only at one 





cusp, say the oo. By formula (4.6) Vap(T)* has infinite product in q? = e*’T with only 
non-negative powers of q. Thus the functions (6.11)) are modular forms of weight 1 with 
respect to T (2). Since 


we see that 94 r is a cusp form. 
2 


6.4 We know that any elliptic curve is isomorphic to a Hesse cubic curve. Let us give 
another cubic equation for an elliptic curve, called a Weierstrass equation. Its coefficients 
will give us new examples of modular forms. Recall that dim Th(k, A+)ab = k. Let use 
<, > to denote the linear span. We have 


Th(2, A+) =< T?, X' >, 
Th(3, A-)11 =< T’, TX',Y' >, 
22 


for some functions X’ € Th(2, A+), Y” € Th(3, A7) 
tions 


11. Now the following seven func- 
22 


TS TAX TXA XS TY TXY Y” 


all belong to the space Th(6, A+). They must be linearly dependent and we have 


OL 4b A + cT? X”? + dX” + eT?Y' IT KR oy 7 SU. (6.12) 
Assume g Æ 0,d Æ 0. It is easy to find 


X =aX4+6T?, Y =7Y'+8XT +wT’ 
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which reduces this expression to the form 
Y?°T — X? — AXT’ — BT’ = 0, (6.13) 
for some scalars A, B. Let 
e(z)=X/T*, gr(z) = Y/T’. 
Dividing (6.13) by T° we obtain a relation 
o1(z)? = p(z)? + Ap(z) + B. (6.14) 


Since both X and T? belong to the same space Th(2, 7) the functions p(z), pı(z) have 
periods À € Z + TZ and meromorphic on C. As we shall see a little later, 9, (2) = S®. 
Consider the map 


E, =C/A> P’, z> (FC), FOX), (2)). 


Since T(z)’, T(z)X (2), Y (z) all belong to the same space Th(3, A+) 1 1 this map is well- 
defined and holomorphic. It differs from the map from Example 3.2 only by a composition 
with a translation on Æ, and a linear change of the projective coordinates coordinates. 
This is because, for any f € Th(k; A) we have 


erilarr +2(z+a)b] f(y a 2+ ar) € Th(k; A)ap 


(see Lecture 3). So it is an isomorphism onto its image. The relation (6.13) tells us that 
the image is the plane projective curve of degree 3 given by the equation 


yt — r? = Art? — BÈ =%, (6.15) 


Now it is clear why we assumed that the coefficients d, g in (6.12) are not equal to zero. If 
g = 0, we obtain an equation f(z, y,t) = 0 for the image of E, in which y enters only in 
the first degree. Thus we can express y in terms of x, ¢ and obtain that Æ, is isomorphic 
to P!(C). If d = 0 we obtain that f could be chosen of degree 2. Again this is impossible. 
Note that we also have in (6.13) 


4A? + 27B?. 40 (6.16) 


This is the condition that the polynomial z? + Ax? + B does not have a multiple root. If 
it has, (6.13) does not define a Riemann surface. A cubic equation of the form (6.15) with 
the condition (6.16) is called a Weierstrass cubic equation. 


68 LECTURE 6. MODULAR FORMS 


We know from Lecture 3 that T = J1i(z;7) has simple zeroes at the points z = 
22 


à € A. Since X does not vanish at these points (it is a linear combinations of T? and 
Voo(z; T)?), p(z) has poles of order 2 at z € A. Differentiating (6.14), we obtain 


201(z)1(2)' = (3(z)? + A)p(z). 


Let 91 (21) = 0. If 39(21)?+ A = 0, the polynomial x? + Ax + B is reducible since p(21) 
must be its double root. So, p1 (z) has common roots with p(z)’. Now both functions have 
a pole of order 3 at points from A. This shows that the function 9)/,’ has no poles and 
zeroes, hence it is constant. Let cf, = p. Replacing p1 by c?@1, p by cp, A by c*A, B 
by c°B we may assume that 

1(z) = p(z}. (6.17) 
Let 

p(z) = a227? + aoz7 +... 


be the Laurent expansion of p(z) at 0. Note that p(z) must be an even function since all 
functions in Th(2, A,) are even. We have 


oi(z) = plz) = —2a_o2z77 + Qaoz+.... 


Plugging in the equation (6.11) we obtain 4a? = a, hence a_2 = 4. Finally, if we 
replace p(z) with o(z)/4 we can assume that 


olz) = 27? + e22? + caz*+..., (6.18) 


and 
plz)? = 49(z)* — g2p(2) — 93- (6.19) 


Here we use the classical notation for the coefficients of the Weierstrass equation. Differ- 
entiating (6.18) we find 


plz) = 9g? + 2cəz + Acaz° aE teeta (6.20) 
Plugging this in the Weierstrass equation (6.19), we easily get 
plz)? — 4p(z)? = —20c22~? — Beg + 27(...). 


Thus the function p(z)? — 4(z)? + 20c2¢(z) + 28c4 is holomorphic and periodic. It 
must be a constant. Since it vanishes at 0, it is identical zero. Comparing this with the 
Weierstrass equation, we find that 


g2 = 20c2, g3 = 28c4. (6.21) 
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After all of these normalizations, the elliptic function (z) with respect to A, is 
uniquely determined by the conditions (6.18) and (6.19). It is called called the Weier- 
strass function with respect to the lattice A+. 

One can find explicitly the function ¢(z) as follows. I claim that 


p(z) = (z) = 5+ L oF -5 (6.22) 


Pea 


First of all the series (6.22) is absolutely convergent on any compact subset of C not 
containing 0. We shall skip the proof of this fact (see for example [Cartan])[?]. This 
implies that ¢(z) is a meromorphic function with pole of order 2 at 0. Its derivative is a 
meromorphic function given by the series 


AL 





It is obviously periodic. This implies that ¢(z) is periodic too. 
Since $(z) is an even function, ¢(z)’ is odd. But then it must vanish at all A € 4A. In 
fact 


$'(—A/2) = —$'(A/2) = —'(—A/2 + A) = —9'(A/2). 


The same argument shows that p(z)" vanishes at the same points. It follows from the 
Cauchy residue formula that the number of zeroes minus the number of poles of a mero- 
morphic double periodic function inside of its fundamental parallelogram is equal to zero 
(see computations from Lecture 3). This shows that ¢’ and g’ has the same set of zeroes 
and poles counting with multiplicities. This ee that ¢'(z) = cp(z)’ for some con- 
stant c. Now comparing the coefficients at z7’ we see that c = 1. So p(z) = ¢(z). 
After integrating we get p(z) = ¢(z)+constant. Again comparing the terms at z7? we 
get (z) = p(z). This proves (6.22). 
After differentiating p(z) at 0 we obtain 


C2 = 3 5 7 C4 =9 > 


AEA\ {0} AEA\ {0} 





Remark 6.2. Now it is time to explain the reason for the names “elliptic functions” and 
“elliptic curves”. We know that the Weierstrass function ¢(z;7) is a solution of the dif- 
ferential equation ($2)? = 4a? — gox — g3. Thus the function z = p7! (zx) is given, up to 


adding a constant, by the indefinite integral 


d 
z= f as (6.23) 
4x3 — gox — 93 
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This is called an elliptic integral. Of course, the function x = (z) does not have single- 
valued inverse, so one has to justify the previous equality. To do this we consider a 
non-empty simply connected region U in the complex plane C which does not contain the 
roots e1, €2, e3 of the polynomial 4x? — goa — g3. Then we define f : U > C by 





fu) = [ = 
“We y 4x3 — gox — 93 


This is independent of the path from u to oo since U is simply connected. Using analytic 
continuation we obtain a multivalued holomorphic function defined on C \ {e1, e2, e3}. 
Using the chain rule one verifies that p(f(u)) = +u. So, f is well-defined as a holomor- 
phic map from C \ {e1, €2, e3} to (C/A,)/(z — —z). It can be shown that it extends 
to a holomorphic isomorphism from the Weiertrass cubic y? = 4z? — goz — g3 onto 
(C/A) \ {0}. This is the inverse of the map given by z + (p(z), e(z)’). As was first 
shown by Euler, the elliptic integral (6.23) with special values of g and g3 over a special 
path in the real part of the complex plane x gives the value of the length of an arc of an 
ellipse. This explains the names “elliptic”. 





6.5 Next we shall show that, considered as functions of the lattice A = Z + Zr, and 
hence as functions of 7, the coefficients g2 and g3 are modular forms of level 4 and 6, 
respectively. Set for any positive even integer k: 


AEA,\{0} 


Assume |T| > R > O and k > 2. Since 


1 
NEZ+7Z\ {0} lztiy|>R 


/ 
R 
we see that F;,(7) is absolutely convergent on any compact subset of H. Thus E,(7) are 
holomorphic functions on for k > 2. From (6.21) we infer 


2T oo 
rti drd = anf ti 
0 R 


g2 = 60F4, 93 = 140 Eg. (6.24) 


We have 








Ort By _. at +B ae 
arr Dla) tm 7 
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(yr + 5)* [((ma + ny)r + (mB + nô) E = (yr + 6)* Ey (7). 
(m,n) #0 
This shows that £,(7) is a weak modular form with respect to the full modular group 
T(1). We can also compute the Fourier expansion at the cusp oo. We have 


1 1 
E = — —— }. 
= E at D (Ega) 
mEZ\{0} nEZ\{0} `mEZ 
Since k is even, this can be rewritten in the form 
=2 B = anm es 
meN nEN (2.7 neN (2.7 


where i 
MS Done Res >1 
mEN 
is the Riemann zeta function. Now we use the well-known formula (see for example 
[Cartan], Chapter V, §2, (3.2)): 


OO 








m cot(mz) = D (z+m)t. 
mEZ 
Setting t = e?”’*, we rewrite the left-hand side as follows: 
cosmz 3 eae hab siege se Z a 
mT cot(mz) = Toes ES Tae =U n 1-2 5 E: 


Differentiating k — 1 > 2 times in z, we get 


co co 
(k—1)! PC +m) (Qri)* es Fm ee 
meZ m=1 


This gives us the needed Fourier expansion of E;,(7). Replace in above z with nr, set 
q = e?™" to obtain 


Ex( 





5 mtg"), (6.25) 
a ! m=1 


It is obviously convergent at q = 0. So, we obtain that E;,(7) is a modular form of weight 
k/2 with respect to the full modular group T (1). It is called the Eisenstein form of weight 
k/2. Recall that k must be even and also k > 4. One can rewrite (6.21) in the form 


E,(r) = 2¢(k) a ole on—1(m)q”, (6.26) 
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where 


(m) = 5 d” = sum of n“ powers of all positive divisors of m. 


Now we observe that we have 3 modular forms of weight 6 with respect to (1). They 
are g3 = 60° E3, g3 = (140)? Eè, A. There is a linear relation between these 3 forms: 


Theorem 6.1. 
(2r)?A = g3 — 2793. 


Proof. First notice that g3 — 27g is equal to the discriminant of the cubic polynomial 
oa — g2x — g3 (this is the reason for naming A the discriminant). Thus the function 
— 2793 does not vanish for any T € H. Since A is proportional to a power of 0 1a and 


i: latter does not vanish on H (because J 11 (z; T) has zero of the first order at 0), we see 


that A also does not vanish on H. Now consider the ratio g3 = 2793 /A. It has neither 
zeroes nor poles in H. Let us look at its behaviour at infinity. Let 


OO OO 
x-V oln), Y= Y oln)" 
n=1 n=1 


We use the well-known formula (see, for example,[Serre][?]), ) 
¢(2r) = ——_ Bo, x", 
r)! 


where B; are the Bernoulli numbers defined by the identity 


oO 


2i 
£ £ ; T 
=1 ' YHB -—, 
et —1 2 2 ) * (2i)! 


i=1 





In particular, 120¢(4) = (27)4/12, 280¢(6) = na and we can write 
1 1 Y 
= (2n)*[— + 20X = (2 s=, 
This gives 
3 ed 12 2 3 2 
9 — 2792 = (2m) 2[(5X + 7Y )/12 + 100X? + 20X? — 42Y?] = 


(2n)!2q + @(...). 

Now from Example 6.1 we have A(T) = q+ q?(...). This shows that the ratio R = 
—2793/A is holomorphic at oo too. This implies that R is bounded on the fundamental 
domain D of ['(1). Since R is invariant with respect to [(1) we see that R is bounded 
on the whole upper half-plane. By Liouville’s theorem it is constant. Comparing the 
coefficients at g, we get the assertion. 
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6.6 Recall that we constructed the modular forms g2 and g3 as the coefficients of the 
elliptic function p(z; T) in its Taylor expansion at z = 0. The next theorem gives a 
generalization of this construction providing a convenient way to construct modular forms 
with respect to a subgroup of finite index T of SL(2, Z). 


Theorem 6.2. Let (z; T) be a meromorphic periodic function in z with respect to the 
lattice A, = Z + Zr. Assume that, as a function of T, it satisfies 


z aTrt+pB 
yr yr +6 








D( 5 


)= (yr + 6)" ®(z;7), V (° A eT cI(1) 

Let gn(T) be the n-th coefficient of the Taylor expansion of ®(z;7) at zo = £T + y for 
some x,y E€ R. Then 

at+ 6B 


YT + 5? = (YT F 8) tgn (T), 





Inl 
for any M € SL(2, Z) such that (x',y') = (x,y) +- M = (x,y) mod Z?. 


Proof. Use the Cauchy formula 








at +B 1 f (z + wits + y; te) 
rA 


TEE OR gut 











In( 




















z(y7+6)+a(at+B8)+y(yT+4) , ar+ 
1 re (Tó) ee u(y ). os: 7 
2ri geil 
1 
Oni ®(2(yr + 6) + zlar + B) + y(yr +8); 7)(yr + 8)” t dz = 
1 fa +a'r ty sr)(y7 +s)” 
: dz = 
271 grt 
1 joer 
: dz. 
2ri grt 


here we integrate along a circle of a small radius with center at 0 in a counterclockwise 
direction. 
After substitution z(y7 + 6) = 2’, we obtain 


at+ 6 
yT +6 





Jnl ) = (yT + 6)" gn (T). 














74 LECTURE 6. MODULAR FORMS 


Example 6.4. We apply the previous theorem to ®(z;7) = p(z) and z = 3. In this case 


r=ro(2)={(% $) er: 2h}, 


Now, replacing z with z/(y7 + ô) in (6.22), we get 


z ar+ß 
YTO yr +6 


9( 











1 
= 27-2 
pero me z—m(yr + 6) nearby 


Since Z + Zr = Z(y7r + ô) + Z(ar + 6), we get finally that 


z or+p 
yr yr +6 








ol ) = (7 + 6)’ p(z; T). 


Thus (z; T) satisfies the assumption of the Lemma with m = 2. Let M € T(1). Since 


(0,4): M — (5,0) € Z? if and only if M € To(2) we obtain that the 0-th coefficient 


go(T) = (5) of the Taylor expansion of p(z) at 5 satisfies 








5 EE) = 7 + Po?) 


elz yr+ô 2 


Similarly, if we replace 5 with 5 and 5 + 5 we get that 











eG AE) = artel v(% 5) ere), 
where 
roa) =4(% §) eroa (4 roh T) 
oG +5: mrt) = Or + 8 (5 +557). 


We skip the verification that (5) and ($) satisfy the regularity condition at the 
cusps. Since both To(2) and T? (2) contain T (2) as its subgroup, we see that 


Crn 
PU Ta 


are modular forms of weight 1 with respect to T (2). 
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Exercises 


6.1 Show that p(z) is a time independent solution of the Kortweg-de Vries partial differ- 
ential equation 
Ut = Ugee — 20, u = U(z, t). 


6.2 Compute the first two coefficients cg, cg in the Laurent expansion of p(z). 
6.3 Show that o(z) = si log v1 ı (z; T)+constant. 
22 


6.4 Let E, \ {0} — C? be the map given by z + (p(z), @(z)’). Show that the images of 
the non-trivial 2-torsion points of Æ, are the points (a;,0), where a; are the zeroes of the 
polynomial 42° — gox — g3. 


6.5 Show that 
p(z1) g(a) 1 
det | o(z2) g’(z2) 1] =0 
re(z3) g'(z3) 1 


whenever z1 + z2 + z3 = 0. Deduce from this an explicit formula for the group law on 
the projective cubic curve y7t = 4x? — goxt? — ggt?. 


6.6 (Weierstrass ¢-function) It is defined by 
Z(zA) == 4 5 (= +2+3). 
weEA\{O} 
Let A = Zw, + Zw. Show that 
(i) Z"(z) = —@(2); 
(ii) Z(z + wi) = Z(z) + m,i = 1,2 where ni = Z(w;/2); 
(iii) mw — nmw = 2m; 
(iv) Z(Az;A+ A) = A7!Z(z;-A), where 2 is any nonzero complex number. 
6.7 Let ¢(z) be a holomorphic function satisfying 
(2)'/(z) = Z(2), 
(i) Show that ¢(—z) = —¢(z); 
G) A(z + wi) = enet H (2); 


(iii) (z) = a(z), where o(z) is the Weierstrass o-function. 
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6.8 Using the previous exercise show that the Weierstrass o-function o(z) admits an in 
finite product expansion of the form 


we A\{0} 


which converges absolutely, and uniformly in each disc |z| < R. 


6.9 Let E, be an elliptic curve and y? = 4a? — gax — g3 be its Weierstrass equation. Show 
that any automorphism of Æ- is obtained by a linear transformation of the variables (x, y) 
which transforms the Weierstrass equation to the form y? = 42° — ctgax — c°g3 for some 
c # 0. Show that E, is harmonic (resp. anharmonic) if and only if gg = 0 (resp. go = 0). 
6.10 Let k be an even integer and let L C R* be a lattice with a basis (e1, . . . , €). Assume 
that ||v||? is even for any v € L. Let D be the determinant of the matrix (e; - ej) and N be 
the smallest positive integer such that N||v*||? € 2Z for all v* € R* satisfying v* -w € Z 
for all w € L. Define the theta series of the lattice L by 


brl) =X Ho € L: |lu]? = Inperrn7, 
n=0 
(i) Show that 8z (7) = Dep eM Ml; 


(ii) Show that the functions ©(0, 7)* discussed in the beginning of Lecture 6 are special 
cases of the function 0z. 


(iii) Show that 0z (T) is “almost” modular form for the group 


T(N) = (£ J € SL(2, Z) : NIc}, 


ar+ 6 
yr +6 


Or ( 





j= rtada), (% 5) Pol), 


k 
where x(d) = (UF?) is the quadratic residue symbol. 





(iv) Prove that 0z (T) is a modular form for ['9(2) whenever D = 1 and k = 0 mod 4. 


6.11 Let ®(z; T) be a function in z and 7 satisfying the assumptions of Theorem 6.2 (such 
a function is called a Jacobi form of weight m and index O with respect to the group I). 
Show that 


(i) 9(z;7) is a Jacobi form of weight 2 and index 0 with respect to ['(1); 
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(ii) o(z;1,7) is a Jacobi form of weight 1 with respect to T (1). 


6.12 Let n be a positive integer greater than 2. Consider the map of a complex torus 
E, \ {0} > C” given by the formula 


z > (1, 9(z),..., (2)? , p(z) 


if n is odd and 


z > (1, (z),... p(z) 


if n is even. Show this map extends uniquely to a holomorphic map fn : E —> P”. Show 
that fn is an isomorphism onto its image (a normal elliptic curve of degree n). Find the 
image for n = 4. 


6.13 Let q = e277, v = e?™, 


(i) Show that the function 





1 1 1 
kA 1 1 e Do eee 
a (q"/2v2 ca q7t/2y72)? 12 eer (qr/ =o t/ ) 


coincides with ¢(z). 


(ii) Using (i) show that (z; 7) considered as a function of 7 has the following Fourier 
expansion 


agp 37) = aa tA G let + oa" 


i ad 
(v2 =v 2)? n=1 d|n 


5 (1- mS ai(n)q”). 
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LECTURE 6. MODULAR FORMS 


Lecture 7 


The Algebra of Modular Forms 


7.1 Let I be a subgroup of finite index of I'(1). We set 
Mg(T) = {modular forms of weight k with respect to T}, 


We also denote by M;(I°)° the subspace of cuspidal modular forms. It is clear that 
M,(T) is a vector space over C. Also multiplication of functions defines a bilinear map 


My (T) x Mi(T) > Mp4). 


This allows us to consider the direct space 


MT) = @ Mill) (7.1) 


k=—0o 


as a graded commutative algebra over C. Since M;,(T)1M,(L) = {0} if k 4 1, we may 
view M (T) as a graded subalgebra of O(H). 
Notice that 


MT) = a Mx(P)! (7.2) 


k=— o0 
is an ideal in M (T). 
We shall see later that there are no modular forms of negative weight. 


7.2 Our next goal is to prove that the algebra M (T) is finitely generated. In particular 
each space M(T) is finite-dimensional. 


Let f(z) be a meromorphic function in a neighborhood of a point a € C and let 


OO 


fzy= 5 Cn(z — a)” 


n=m 
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be its Laurent expansion in a neighborhood of the point a. We assume that cm # 0 and 
set Va( f) = m. We shall call the number va( f) the order ( of zero if m > 0 or of pole if 
m <0) of f at a. If f is meromorphic at oo we set 


Voo(f) = vo( f(1/z)). 


Note that when f is a modular form with respect to a group T we have 


Vg.7(f) =v (f), YgEr. 
For each r € H let 


2 ifrer(1)-i, 
m= 43 ifr eri) e3, (7.3) 
1 otherwise. 


Lemma 7.1. Let f(T) be a modular form of weight k with respect to the full modular 
group T (1). Then 


TEH/T(1) 


Proof. Consider the subset P of the modular figure D obtained as follows. First delete 
the part of D defined by the condition Im 7 > h for sufficiently large h such that f has 
no zeroes or poles for Im 7 > h. Let C, (p), C;(p2), C (i) be a small circle of radius r 
centered at p = e™/3 at p° and at i, respectively. Delete from D the intersection with each 
of these circles. Finally if f(z) has a zero or pole a at the boundary of D we delete from 
D its intersection with a small circle of radius r with center at a. 


Fig.1 
Applying the Cauchy Residue Theorem we obtain 


1 f'dr v-(f) 
mf P= Deny 
miJop f TEP rep ™ 
When we integrate over the part OP, of the boundary defined by Im 7 = h we obtain 
1 f'dz 
271 OP; f 





= —Vo0(f). 
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In fact, considering the Fourier expansions of f at oo, we get 


f(r) = a ane T, 
n=væ(f) 
fGy= Fi (2rinjane 
n=væ(f) 


Use the function q = e?*'7 to map the segment {r : |Re T| < 4,Im 7 = h} onto the 
circle C : |q] = e~?™". When we move along the segment from the point 5 + ih to the 
point -4 + ih the image point moves along the circle in the clockwise way. We have 


1 f'dr _ 1 J (Qrivos(f)qve) +...)dq 
ani Jap, f =~ mijo 2riqlay (pP) +...) 








= —væ( f). 


If we integrate along the part 0 P2 of the boundary of P which lies on the circle C, (p2) 


we get 
f'dr 1 
= gue lf). 


This is because the arc OP approaches to the one-sixth of the full circle when its ra- 
dius goes to zero. Also we take into account that the direction of the path is clockwise. 
Similarly, if we let ôP = OP N C, (i), OP, = OPN C;(p), we find 


lim - 
r=>0 2ri OP» f 











1 
1 
lim — fdr = —-1y;,(f). 
r>0 2ri Jap, f 2 
1 : 1 
lim — fee = —-v( f). 
r0 2ri Jop, f 6 


Now the transformation T : 7 — 7 + 1 transforms the path along oP from -4 + ih to 
p° to the path along the boundary from the point p to the point $ + th. Since our function 
satisfies f(T + 1) = f(T) and we are moving in the opposite direction along these paths, 
the two contributions to the total integral cancel out. Finally, if we consider the remaining 
part of the boundary, and use the transformation S : T > -1 we obtain 


df(=) | d(r?* de d 
ia) (TF (7) _ 4,47 , of 





F>) f(r) T f’ 


When we move from p° to i the point S -r moves from p to i. This easily gives us that the 
portion of the integral over the remaining part of the boundary is equal to (when r goes to 


zero) 
1 [= - 2k( Dat 
Y 





211 T 12 
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where y is the part of the circle 7 = 1 starting at p? and ending at i. Collecting everything 
together we obtain the assertion of the lemma. 














Theorem 7.1. M;,(T(1)) = {0} ifk < 0. Ifk > 0, we have 


amis Ne if k = 1 mod (6) 
[k/6]+1 otherwise. 

Proof. Let f(r) € Mk(T(1)). Then v, > 0 for all 7 € H, and Lemma 7.1 implies 

that ; = A4 B Ç for some non-negative integers A, B,C’. Clearly this implies that 

dim M(T(1)) = {0} when k < Oor k = 1. If k = 2 we must have A = B=0,C = 1. 

Since f € Mo2(I(1)) we have 





In particular, this is true for gz. For any other f € Mo(I(1)) we have f/g2 is (1) 
invariant and also holomorphic at co (since gə is not a cusp form). This shows that f /g2 
is constant and 


M2(T(1)) = Cg2. 


Similar arguments show that 


Ma(T(1)) = Cas, 
Ma(P(1)) = C93, 
Ms (T(1)) = C9293. 


This checks the assertion for k < 6. Now for any cuspidal form f € M,(I(1)) with 
k > 6 we have f /A is a modular form of weight k — 6 (because A does not vanish on H 
and has a simple zero at infinity). This shows that for k > 6 


Mx(P(1))° = AMz-6(T(1)). (7.4) 
Since M;(I'(1))/M;(I(1))° & C (we have only one cusp) we obtain for k > 6 


dim M,,(I'(1)) = dim M,_6(P(1)) + 1. 














Now the assertion follows by induction on k. 


Corollary 7.1. The algebra M(T(1)) is generated by the modular forms g2 and g3. The 
homomorphism of algebras @ : C{T,, T2] + M(T(1)) defined by sending T; to g2 and T> 
to g3 defines an isomorphism between M(T(1)) and the algebra of complex polynomials 
in two variables. 
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Proof. The first assertion is equivalent to the surjectivity of the homomorphism ¢. Let us 
prove it. We have to show that any f € M(T(1)) can be written as a polynomial in g2 
and g3. Without loss of generality we may assume that f € M(T(1))ẹ for some k > 0. 
Write k in form k = 2a + 3b for some nonnegative integers a and b. Since g$g does 
not vanish at infinity, we can find a constant c such that f — cg$ g is a cuspidal form. By 
(7.4), it is equal to gA for some g E€ M(I'(1))x—6. Since A is a polynomial in g2 and g3, 
proceeding by induction on k we prove the first assertion. To prove the second assertion 
we use that any element F (T; , T2) from the kernel of ¢ can be written uniquely as a sum 
of polynomials Gj satisfying 


GalT’T, Tt3) = TÎGa(T, Tə) 


for some d > 0 and any 7 € H. In fact, writing F as a sum of monomials in 7), 7> we 
define G4 as the sum of monomials T/73 entering into F such that 2i + 3j = d. Since 


F(g2(—1/7), 93(—1/r)) = F (1792, 7°93) = 0, 


each Gq must belong to the kernel of ¢. This allows us to assume that F = G4 for some 
d. Dividing by T we obtain Ga(g2, 93)/9$ = G(g3/g93) = 0 for some polynomial G in 
one variable T = T\/T>. Since C is algebraically closed, g3/g% must be a constant. But 
this is impossible since g3 vanishes only at T - i and gz vanishes only at T (1) - p. 














Corollary 7.2. The ideal of cuspidal modular forms M? (T (1)) is generated by A. 


Proof. We have seen already in (7.4) that M;(T'(1))° = AM;,_¢(I'(1)). Also we have 
Mg(T(1))? = {0} for k < 6. This checks the assertion. 














7.3 Let us give some examples. 


Example 7.1. We know that the Eisenstein series Eo; is a modular form of weight k with 
respect to ['(1). Since M4(T(1)) = Cg} = CE?, comparing the constant coefficients in 
the Fourier expansions we obtain 





((8)_p2 


m= a 


Comparing the other coefficients we get a lot of identities between the numbers o;,(7). 
For example, we have 


or(n) = 03(n) + 120 — oz3(m)oz(n — m). (7.5) 
0<m<n 
Similarly we have 


Ex. Es. 
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This gives us more identities. By the way our old relation 
(27) 7A = g3 — 2793 


gives the expression of the Ramanujan function T(n) defined by 





m=1 n=0 
in terms of the functions o;(n): 
65 691 691 
T(n) = zgra f z565) 3 `> os(m)os(n — m). (7.6) 


0<m<n 


We shall prove in Lecture 11 that 7(7) satisfies 
ram) =7(n)r(m) if (n,m) = 1, 


ktt) paN 


T(p = (p)T(p*) — p!tr(p if p is prime, k > 0. 


Example 7.2. Let L be a lattice in R” of rank n such that for any v € L the Euclidean norm 
||v||? takes integer values. We say that L is an integral lattice in R”. If (v1,..., Un) isa 
basis of A, then the dot products a;; = v; : vj define an integral symmetric non-degenerate 
matrix, hence an integral quadratic form 


n 
Q = 5 QijLilj. 
ij=1 
Obviously for any v = (a1, .. . , an) Æ 0 we have 
Q(v) = |le||? > 0. 


In other words, Q is positive definite. Conversely given any positive definite integral 


quadratic form Q as above, we can find a basis (e},. . . , e/,) such that Q diagonalizes, i.e. 
its matrix with respect to this basis is the identity matrix. Let ¢ : R” — R” be the linear 
automorphism which sends the standard basis (e1, ... , €n) to the basis (e},...,e/,). Then 


the pre-image of the standard lattice Z” = Ze, +... + Zen is an integral lattice L with 
the distance function Q. 
Let us define the theta function of the lattice L by setting 


iG) => rma" = GO, (7.7) 


m=0 vEL 
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where 
rL(m) = #{v € L : Qw) = 2m}. 

(see Exercise 6.10). Since rz(m) < (2m)"/? (inscribe the cube around the sphere of 
radius \/2m), and hence grows only polynomially, we easily see that 0z (T) absolutely 
converges on any bounded subset of H, and therefore defines a holomorphic form on H. 

We shall assume that L is unimodular, i.e. the determinant of the matrix (a;;) is equal 
to 1. This definition does not depend on the choice of a basis in L and is equivalent to the 
property that L is equal to the set of vectors w in R” such that w-v € Z forall v € L. 
For example, if L is the standard lattice Z” we see from Lecture 4 that 


ozn (T) = O(0, 7)”. 


Repeating the argument from the beginning of Lecture 4 we obtain that, for any unimod- 
ular lattice L, 

OL(—1/7) = (—ir)"/?6z(r). (7.8) 
Also, if we additionally assume that L is even, i.e. Q(v) € 2Z for any v € L, we 


obviously get 
OLIT + 1) = 61 (T). 


In particular, if 8|n we see that 0z (T) is a modular form with respect to T (1). It is amazing 
that one does not need to assume that n is divisible by 8. It is a fact! Let us prove it. 
Assume n is not divisible by 8. Replacing n by 2n (if n is even)(resp. 4n if n is odd), and 
Lby LỌ L (resp. by L LG LG L), we may assume that n is divisible by 4 but not by 
8. By (7.8) we get 

6,(—1/r) = —1"/?0z(r). 


Since 0z is always periodic with respect to 1, this implies 
Orla ST = =p at = —Or. 


Obviously this contradicts the fact that (ST)? = 1. Now we know that for any even 
unimodular lattice 


br E Myya(P(1)). (7.9) 


Now let n = 8. Since Mə(T(1)) = CE, we see that 0z, is proportional to the 
Eisenstein series Æ4. Comparing the constant coefficients we see that 


br = E4/2Ç(4). 


In particular, for any m > 1, 


rL(m) = 24003(m). (1.10) 
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In fact there exists only one even unimodular lattice in RE (up to equivalence of lattices). 
The lattice is the famous Eg lattice, the root lattice of simple Lie algebra of type Eg. 


Fig.2 
Here the diagram describes a symmetric matrix as follows. All the diagonal elements 
are equal to 2. If we order the vertices, then the entry a;; is equal to —1 or 0 dependent 
on whether the i-th vertex is connected to the j-th vertex or not, respectively. 
Take n = 16. Since M4(T(1)) = CEs, we obtain, by comparing the constant 
coefficients, 


8L = Eg/2¢(8). 


In particular, we have 
rm) = 1607(m)/Ba, (7.11) 


where By is the fourth Bernoulli number (see Lecture 6). There exist two even unimodular 
lattices in R!°. One is Eg Eg. Another is T16 defined by the following graph: 





Now let n = 24. The space iy: aan by A and Ej. We can write 
0L = tn, + cL. 
XU 
This gives 
rem) = on (m) +cerT(m), (7.12) 


where T(m) is the Ramanujan function (the coefficient at q™ in A). Setting m = 1, we 


get 
65520 


— 7.1 
691 ee 


ex =rL(1)— 


Clearly, cr, Æ 0. 

Except obvious examples Eg6 Es Eg or Eg PT6 there are 22 more even unimodular 
lattices of rank 24. One of them is the Leech lattice A. It differs from any other lattice by 
the property that r,(1) = 0. So, 


65520 
rL(m) = -91 





(o11(m) -r(m)), (7.14) 


In particular, we see that 
T(m) = o11(m) mod 691. 


This is one of the numerous congruences satisfied by the Ramanujan function T(m). 
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7.4 Our goal is to prove an analog of Theorem 7.1 for any subgroup of finite index T 
of T'(1). Let I’ C T be two such subgroups. Assume also that I” is normal in I and 
let G = T/T” be the quotient group. The group G acts on M,;(I”) as follows. Take a 
representative g of g € G. Then set, for any f € M;,(I”), 


9-f =fileg. 


Since f|xg’ = f for any g’ € I” this definition does not depend on the choice of a 
representative. 
The following lemma follows from the definition of elements of M;(T). 


Lemma 7.2. Let I’ be a normal subgroup of T and G =T /T". Then 
Mu(P) = MTE = {f E€ Mal"): g: f = f,Yg € G}. 


It follows from this lemma that the algebra M (T) is equal to the subalgebra of M (T”) 
which consists of elements invariant with respect to the action of the group T/T”. Let n be 
the order of the group G = T/T” (recall that we consider only subgroups of finite index 
of ['(1)). For any f € M(I”) we have 


I[(f-G@-f)) =0 
gEG 


since the factor of this product corresponding to 1 is equal to zero. We have 
F +h ftt... +hn=0, (7.15) 


where h; are symmetric polynomials in g - f,g € G. Clearly they are invariant with 
respect to G and hence, by Lemma 7.2, represent elements of M (T). In particular we see 
that for any normal subgroup T of T (1) 


MT) = {0}, k<0. 


In fact, any modular form of negative weight k will satisfy an equation (7.6) where we 
may assume that each coefficient h; is a modular form of weight ik with respect to T (1). 
However no such modular forms exist except zero. If I is not normal we choose a normal 
subgroup of finite index I” of T and apply Lemma 7.2. 


Lemma 7.3. Let B be any commutative algebra over a field F without zero divisors and 
A be a Noetherian subalgebra of B. Assume that each element b € B satisfies a monic 
equation with coefficients in A: 


b abe H.. HHan =O 


(we say in this case that that B is integral over A). Also assume that the field of fractions 
of B is a finite extension of the field of fractions of A. Then B is finitely generated F- 
algebra if and only if A is finitely generated F-algebra. 
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Proof. This fact can be found in any text-book in commutative algebra and its proof will 
be omitted. 














Theorem 7.2. For any subgroup T of finite index of T (1) the algebra M (T) is a finitely 
generated algebra over C. 


Proof. Let I’ be a normal subgroup of finite index in I'(1) which is contained in T. It 
always can be found by taking the intersection of conjugate subgroups g`} -T -g,g € 
T(1). We first apply Lemma 7.3 to the case when B = M(I”), A = M(I(1)). Since 
A = C[T}, To] is finitely generated, B is finitely generated. It follows easily from (7.15) 
that the field of fractions of B is a finite extension of the field of fractions of A of degree 
equal to the order of the group T/T”. Next we apply the same lemma to the case when 
B= M(I"),A = M(L). Then B is finitely generated, hence A is finitely generated. 














Corollary 7.3. The linear spaces Mp (T) are finite-dimensional. 


Proof. Let fi,..., fk be a set of generators of the algebra Mg(T). Writing each f; as a 
linear combination of modular forms of different weights, and then adding to the set of 
generators all the summands, we may assume that M;(I) is generated by finitely many 
modular forms f; E€ Mk (T), i = 1,...,n. Now M;,(L) is spanned as a vector space 
over C by the monomials gr an fin where kiii +... + inkn = k. The number of such 
monomials is finite. It is equal to the coefficient at t* of the Taylor expansion of the 
rational function 3 


1 
— thi)’ 
Urm 














In the next lecture we shall give an explicit formula for the dimension of the spaces 


M;(D). 


Exercises 


7.1 Find a fundamental domain for the principal congruence subgroup T (2) of level 2. 
7.2 Using Exercise 7.1 find and prove an analog of Lemma 7.1 for the case F = T (2). 
7.3 Let n = 8k. Consider the subgroup I'n of R” generated by vectors v = (a1,...,@n) 
with a; € Zand a, +...+ an € 2Z and the vector (5,---55): 

(i) Show that I, is an even unimodular lattice in R”. 


(ii) Show that I's is isomorphic to the lattice Eg defined in the lecture. 
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(iii) Show that 16 can be defined by the graph from Fig.3 
(iv) Show that T46 is not isomorphic to l'8 6 T's. 


(v) Compute the number of points (x1,..., £8) € RE such that 2x; € 2Z, £i — zj E 
Z,t1 +... + £g EZ, r? +... +r? = 2N, where N = 1,2. 


7.4 Let L C R” be an integral lattice not necessary unimodular. Using the Poisson 
formula from Lecture 4 show that 





1 Tinjq 1 
sp = GeO 


where L* is the dual lattice defined by L* = {v € R” : v- x € Z forall z € L} and Dz 
is the discriminant of L defined by Dz = #L*/L. 


7.5 Let C be a linear subspace of F} (a linear binary code). Let Lo = gar (EC ), where 
r is the natural homomorphism Z” — F3. 
(i) Show that Lc is an integral lattice if and only if for any x = (€1,...,€n) € C the 


number wt(x) = #{i : e; # 0} (called the weight of x) is divisible by 4. In this 
case we say that C is a doubly even linear code. 


(ii) Show that the discriminant of the lattice Lc is equal to 2"~?*, where k = dim C. 
q 


(iii) Let C+ = {y € F} : x -y = 0,Vx € C}. Show that Lc is integral if and only if 
GRA O 


(iv) Assume C is doubly even. Show that Lo. = L@. In particular, Lo is a unimodular 
even lattice if and only if C = C+ (in this case C is called a self-dual code). 


(v) Let C C F$ be a self-dual doubly even code. Show that n must be divisible by 8. 
7.6 Let A(T) = 0(0;7), B(T) = 91 9(0; T). 


(i) Show that 


1 


clic 
(A(-1/7), B(-1/7)) = (F)? (A(T), B(7)) - GE 4) : 
v2 v2 


(ii) Show that the expression A*B*(A* — B*)* is a modular form of weight 6 with 
respect to T (1). 


(iii) Show that A*B*(A* — B*)* = 16A(r). 
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(iv) Show that AS + 1444B4 + BS = 547 E4 (T). 


7.7 Let C C F} bea linear code. Define its weight enumerator polynomial by 


WolX, Y) = >b xr-wt(z)ywt(e) = I AXT, 
xrEC i=0 


where A; is the number of x € C with wt(x) = i. 


(i) Show that 
940 = Wc(A, B). 


(ii) Prove MacWilliams’s Identity: 


1 

Woi (X,Y) = zama Wo(X +Y,X - Y). 

Gii) Using Theorem 7.1 show that for any self-dual doubly even code the enumerator 
polynomial Wc (X, Y) can be written as a polynomial in X’ = X8+14XtY +Y °$ 
and Y’ = X4Y4(X4 — Y*)4 (Gleason’ Theorem). where A, B are defined in the 
previous problem. 


(iv) Deduce from (iii) that the enumerator polynomial Wc(X, Y) of any doubly even 
self-dual linear code is a symmetric polynomial in X,Y (i.e. Wo(X,Y) = Wo(Y, X)). 
Give it an independent proof using only the definition of We (X,Y). 


7.8 Let C be a self-dual doubly even linear code in F34 and 6z6 (T) = D> rro(m)q™ be 
the theta function of the even unimodular lattice Lc associated to it and Wo(X,Y) = 
Y A; X*Y 4 be its weight enumerator polynomial. 


(i) Show that 


Tho (2) =48 +1644, rr, (4) = 22 Ag + 640.Aq + 1104. 
(ii) Using (7.13) show that Ag = 759 — 444. 


7.9 Let A = P_n be a commutative graded algebra over a field F. Assume A has 
no zero divisors, Ag = F - 1 and dim Ay > 1 for some nN > 0. Show that An = 0 for 


n < 0. Apply this to give another proof that M(T) = 0 for k < 0. 


7.10 Find an explicit linear relation between the modular forms E16, E? and E4E10, 
where Esg denotes the Eisenstein series. Translate this relation into a relation between 
the values of the functions oa(m). 


7.11 Let f(T) be a parabolic modular form of weight k with respect to (1). 
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(i) Show that the function ¢(7) = |f(r)|(Im 7)* is invariant with respect to (1). 
(ii) Show that ¢(7) is bounded on H (it is not true if f is not cuspidal). 


(iii) Show that the coefficient an in the Fourier expansion f(T) = ` anq” can be 
computed as the integral 


1 
an = J f(z + iy)e 27+) dy, 
0 


(iv) Using (iii) prove that |an| = O(n") (Hecke’s Theorem). 


7.12 Let L be an even unimodular lattice in R°* and rz (m) be defined as in Example 7.2. 
Using the previous exercise show that 
8k 


rL(m) = Boy 1”) + O(m**), 


7.13 Let L = Eg © Eg @ Eg. Show that 


1 
p + 232000 a 
C(12) 691 





6, = 
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Lecture 8 


The Modular Curve 


8.1 In this lecture we shall give an explicit formula for the dimension of the spaces 
M;(T), where I is any subgroup of finite index in SL(2, Z). For this we have to apply 
some techinique from algebraic geometry. We shall start with equipping H*/T with a 
structure of a compact Riemann surface. 

Let T be a subgroup of SL(2, R). We say that T is a discrete subgroup if the usual 
topology in SL(2, R) (considered as a subset of R4) induces a discrete topology in I’. The 
latter means that any point of I’ is an open subset in the induced topology. Obviously 
SL(2,Z) is a discrete subgroup of SL(2,R). We shall consider the natural action of 
SL(2, R) on the upper half-plane H by Moebius transformations. 


Lemma 8.1. Any discrete subgroup T of SL(2, R) acts on H properly discontinuously. 


Proof. Observe that the group SL(2, R) acts transitively on H (view the latter as a subset 
of R? of vectors with positive second coordinate). For any point z € H the stabilizer 
group is conjugate to the stabilizer of say z = i. The latter consists of matrices (2 b) E€ 
SL(2, R) such that a = d, b = —c. It follows that this group is diffeomorphic to the circle 
{(a,b) € R? : a? +b? = 1}. This shows that the map f : SL(2,R) — H defined by 
f(g) = g- iis diffeomorphic to a circle fibration over H. This easily implies that pre- 
image of a compact set is compact. Let A, B be two compact subsets in H. We have to 
check that X = {g €T : g(A) A B # Ø} is finite. Clearly, g(A) O B F 0 if and only if 
gg’ = g" for some g’ € f—'(A), g” € f-1(B). Since A! = f—'(A) and B’ = f-1(B) 
are compact subsets of the group SL(2, R) the set B’ - A’~! is also compact. In fact, 
this set is the image of the compact subset B’ x A’ of SL(2, R) x SL(2,R) under the 
continuous map (g’, g”) > g'g’~'. Thus X is equal to the intersection of the discrete 
subset I with a compact subset of SL(2, R), hence it is a finite set. 














Applying the previous Lemma and Theorem 2.2 we obtain that H/T has a structure 
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of a Riemann surface and the canonical map 
nr: H —> H/T (8.1) 


is a holomorphic map. 


Example 8.1. Let T =T (1). Let us show that there exists a holomorphic isomorphism 
H/SL(2, Z) S C. 


This shows that the set of isomorphism classes of elliptic curves has a natural structure of 
a complex manifold of dimension 1 isomorphic to the complex plane C. Since g3 and A 
are of the same weight, the map 


HPC) 7 (g2(r)’, A(z) 


is a well defined holomorphic map. Obviously it is constant on any orbit of T (1), hence 
factors through a holomorphic map 


f H/T) > P(C). 


Since A does not vanish on H, its image is contained in P!(C) \ {oo} = C. I claim that f 
is one-to-one onto C. In fact, for any complex number c the modular form f = g3 — cA is 
of weight 6. It follows from Lemma 7.1 that f has either one simple zero, or one zero of 
multiplicity 2 at the elliptic point of order 2, or a triple zero at the elliptic point of index 
3. This shows that each c € Z occurs in the image of j on H/T and only once. 

We leave to the reader the simple check that a bijective map between two complex 
manifolds of dimension 1 is an isomorphism. 


Notice that the explicit isomorphism H/SL(2, Z) — C is given by the holomorphic 
function T — g3/(g3 — 279%). The function 

172893 1728(27)!%93 

j ( T) = 2 ( ) 2 


— E (8.2) 
g — 2792 A 





is called the absolute invariant. The constant factor 1728 = 12° is inserted here to nor- 
malize the coefficient at q7} for the Fourier expansion of j at oo: 


co 
jr) =q} +744 + 5 Cng, q= e°". (8.3) 
n=1 
We have proved that 
E, S By <=> j(T) = j(T'). (8.4) 








The coefficients cn in (8.3) have been computed for n < 100. The first three are 
cı = 196884, cy = 21493760, c3 = 864299970. 


They are all positive and equal to the dimensions of linear representations of the Griess- 
Fisher finite simple group (also called the Monster group). 


95 


8.2 The Riemann surface H/T is not compact. To compactify it we shall define a com- 
plex structure on 
H*/T = H/T U {cusps}. (8.5) 


First we make H* a topological space. We define a basis of open neighborhoods of œo as 
the set of open sets of the form 


U: = {r E€ H : Im? > c}U {oo}, (8.6) 


where c is a positive real number. Since SL(2, Z) acts transitively on H* \ H we can take 
for a basis of open neighborhoods of each x € Q the set of g-translates of the sets Ue for 
all c > 0 and all g € SL(2, Z) such that g - oo = x. Each g(U-) is equal to the union of 
the point x and the interior the disk of radius r = T touching the real line at the point 


x. In fact, if g = (° at we have x = a/y and 








ò 
I 
gU.) ={r €H:Img!-r>ch={reH: mT 5 > 9}= 
|—yr+al (8.7) 
{r=a+iy:(e@— [+y . P< : } . 
l y 272c dyte? 


Now the topology on H*/T is defined as the usual quotient topology: an open set in H*/T 
is open if and only if its pre-image in H is open. Since |y| > 1 in (8.7) unless g € Tx, 
we can find a sufficiently large c such that 


Pæ = {g ET : g(Ue) O Ue # Ø}. 
Now, if x = gı - co we deduce from this that 


Pe = gil ogy = {g ET : g(91(Uc)) A g1 (Ue) £ 0}. (8.8) 


This shows that the pre-image of some open neighborhood of a cusp on H*/T is equal to 
the disjoint sum of open neighborhoods of the representatives of this cusp. 


Theorem 8.1. Let T be a subgroup of finite index in SL(2, Z). The topological space 
H*/T admits a unique structure of a compact complex manifold of dimension 1 such that 
H/T is an open submanifold. 


Proof. To warm up let us first see this in the case F = SL(2,Z). We saw in Example 
1 that H/T(1) = C. The complex plane C admits a natural compactification. It is the 
Riemann sphere P!(C) = CU {oo}. The point oo represents the unique cusp of I'(1). 
Thus we see that 

H*/T = PC). (8.9) 
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Now let us consider the general case. The canonical holomorphic map mpa) : H > 
H/T (1) is equal to the composition of the holomorphic maps mp : H — H/T and 
Tryra) : H/T + H/T (1). It extends to the composition of continuous maps 


wha HY D H/T TAO ar D(1) = P(C). 


First we see that the orbit space H* /T is a Hausdorff topological space. This is obviously 
true in the case I = ['(1). Since H/T is Hausdorff, we can separate any two points which 
are not cusps. Since we can separate oo on H*/T (1) from any finite point, we can separate 
any pre-image of oo in H*/T(1), which is a cusp on H*/T(1), from a point on H/T. 
Finally we can separate any two cusps in H/T since the pre-image ™a) (U) of an open 
neighborhood U of oo € H*/T(1) is equal to the disjoint union of open neighborhoods 
of points in H* \ H = P! (Q). The pre-image 74(V(c)) of an open neighborhood V (c) of 
a cusp c =T -x € H*/T is the disjoint union of open neighborhoods of points belonging 
to the orbit IT - x. Obviously for two different T-orbits ¢ and c’ these sets are disjoint. 
Thus the open sets V(c) and V(c’) are disjoint. Let U = gı (Uc) be a neighborhood of a 
representative x = gı - co of some cusp c of I. The natural inclusion U — H* factors 
through the map U/T, — H*/T. Taking c small enough and using (8.8) we see that 
this map is injective. Its image is an open neighborhood U of the cusp c € H*/T. Let h 





be the index of the cusp. Then I’, consists of matrices + T ag and hence the map 
T — e2t7/P sends U /Tz into C with the image isomorphic to an open disk. This defines 


a natural complex structure on the neighborhood U. Notice that it is consistent with the 
complex structure on UN H/T = U \ {c}. Also it is easy to see that the map ™ 1) extends 
to the composition of holomorphic maps. 

It remains to prove the last assertion, the compactness of H*/T. First of all, we 
replace I by a subgroup of finite index I’ which is normal in T (1). Then 


WT = (H PYT H/T) = (H/T) /(P/T()) 














It remains to use the following simple fact from topology: 


Lemma 8.2. Let G be a finite group acting continuously on a topological space X. Then 
X is compact if and only if X/G is compact. 


Proof. Consider the projection 7 : X + X/G = Y. It is a surjective map. It is obvious 
that the image of a compact space is compact. Assume that Y is compact. Take an open 
cover {U;} of X. Then replacing U; with Ugeag(U;) we may assume that each U; is 
G-invariant. Since U; = 7~'(m(U;)) the sets 7(U;) are open in Y. Since Y is compact 
we can find a finite subcover of {7(U;)}. This will give us a finite subcover of {U;}. 
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Remark 8.1. The assertion of the previous theorem does not extend to any discrete sub- 
group of SL(2,R). For example, if we take T = {1}, the space H* = H*/{1} does 
not have any complex structure. In fact, any open neighborhood U of oo, after deleting 
oo, must be isomorphic to the punctured open unit disk {z € C : 0 < |z| < 1}. The 
latter space is not simply-connected (its fundamental group is isomorphic to Z). However 
U \ {co} can be always chosen to be equal to Im 7r > c which is simply-connected. How- 
ever, there is a large class of discrete subgroups of SL(2, R), including subgroups of finite 
index in SL(2, Z), for which the assertion of the theorem remains true. These groups are 
called fuchsian groups of the first kind. 


Definition. The compact Riemann surface H*/T is called the modular curve associated 
to the subgroup T of SL(2, Z) and is denoted by X (T). 


8.3 Now let us discuss some generalities from the theory of compact Riemann surfaces. 
Let X be a connected compact Riemann surface and f be a meromorphic function on X. 
This means that the restriction of f to any open neighborhood U is equal to the quotient 
of two holomorphic functions on U. Assume f Æ 0. For each point x € X we can define 
the order v,(f) of f at x as follows. First we identify a small neighborhood U of x with 
a small neighborhood V of 0 in C. Then f is equal to the pre-image of a meromorphic 
function on V which admits a Laurent expansion anz” + an412”t! +... with an 4 0 for 
some integer n. We set 


Vet) = m: 


It is easy to see that this definition does not depend on the choice of an isomorphism 
between U and V. When v,(f) > 0 (resp. vz(f) < 0) we say that vz (f) is the order of 
zero (resp. the order of pole) of f at x. We have the following easily verified properties 


of vz(f): 


Lemma 8.3. Let x € X and f, g be two meromorphic functions on X. Then 
(i) Val fg) = Vel f) + ve(g); 
(ii) ve(f +g) = min{r2(f), ve} if f +g 40. 


A meromorphic function on X is called a local parameter at x if v,(f) = 1. Lemma 
8.3 (i) allows us to give an equivalent definition of v,.(f). It is an integer such that for any 
local parameter ¢ at x, there exists an open neighborhood U in which 


f — PPE 


for some invertible function € € O(U). 


Let Div(X) be the free abelian group generated by the set X. Its elements are called 
divisors. One may view a divisor as a function D : X — Z with finite support. It can be 
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written as formal finite linear combinations D = X` a,x, where az = D(x) € Z,x € X. 
For any D = X` ayx € Div(X) we define its degree by the formula: 


deg(D) = X` az. (8.10) 


There is an obvious order in Div(X) defined by choosing positive elements defined 
by positive valued divisors. We say D > 0 if D is positive or equal to 0. 
For any nonzero meromorphic function f we define the divisor of the function f by 


div(f) = J ve(f)«. (8.11) 


Here we use the compactness of X to see that this sum is finite. Using Lemma 8.3, we 
see that divisors of functions (principal divisors) form a subgroup P(X) of Div( X). Two 
divisors from the same coset are called linearly equivalent. The group Div(X)/P(X) is 
called the group of classes of divisors. 

Finally we introduce the space 


L(D) ={f E€ M(X)*: (f) + D> Oo}. (8.12) 


The famous Riemann-Roch theorem provides a formula for the dimension of this space. 
In order to state it we need two more ingredients in this formula. The first one is the notion 
of the canonical class of divisors. 


Definition. Let U be an open subset of a Riemann surface X and t : U + Cis a holomor- 
phic function defining an isomorphism from U to an open subset of C. A meromorphic 
differential on U is an expression w of the form 


w= f(t)dt, 


where f(t) is a meromorphic function on U. A meromorphic differential on X is a col- 
lection w = {f(ty)dtyu} of differentials on open subsets U as above which cover X. It 
must satify the following compatibility property: if two open sets U and U’ overlap then 








dty: 

p 

fu = fu ity 

when restricted to U N U’. Here “t is the derivative of the function gu’ = tyro i 


dtu 
tu(UNU') sty (UNU’). 


Two meromorphic differentials are said be equal if they coincide when restricted to 
the subcover formed by intersections of their defining covers. 
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Let w = { f(ty)dtu } be a meromorphic function on X. Define 
Vve(w) = Ve( fu). (8.13) 


Since the function ae is invertible at x, we see that this definition is independent of the 


choice of an open neighborhood U of x. The divisor 
div(w) = > Vz (Ww) a. (8.14) 
x 


is called the divisor of the meromorphic differential w 
Since X is compact and hence can be covered by a finite set of locally compact sub- 
sets, we see that div(w) is well-defined. 


Lemma 8.4. Let w and w be two meromorphic differentials on X. Then their divisors 
div(w) and div(w’) are linearly equivalent. 


Proof. Without loss of generality we may assume that w and w” are defined on the same 
open cover and use the same local parameter functions ty. If w = fydty andw’ = fyrdty 
then the collection of meromorphic functions fy / fọ define a meromorphic function F on 
the whole X (since fu / fu = fi / ft, for any two overlapping open subsets in the cover). 
It follows from the definition that 


div(w) = div(w’) + div(F). 











This proves the assertion. 





Definition. The class of linear equivalence of the divisor div(w) of a meromorphic dif- 
ferential is called the canonical class of X and is denoted by Kx. 


8.4 We can state (without proof) the following: 
Theorem 8.2. (Riemann-(Roch) For any divisor on X, 
dim L(D) = deg(D) + dim L(Kx — D)+1-—g 


for some non-negative integer g, called the genus of X. 


Note that the space L(D) depends only on the linear equivalence class of D. In fact, 
if D' = D + div(f), then the map g — gf establishes a bijective linear map from L(D’) 
onto L(D). We use this remark to explain the notation L(x — D) (where Kx is not 
a divisor but rather a class of divisors). This remark, together with the Riemann-Roch 
formula proves the following: 
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Corollary 8.1. Linearly equivalent divisors have the same degree. In particular, for every 
non-zero meromorphic function f on X, 

deg(div(f)) = 0. (8.15) 


Proof. Replacing D with D + div(f), we do not change the dimensions of the spaces 
L(D) and L(K x — D) but change deg(D) by deg(D + div(f)) = deg D + deg(div(f)). 
It follows from Riemann-Roch that deg(div(f)) = 0. 














Corollary 8.2. 
deg Kx = 2g — 2. 


Proof. Take D = 0 and use that L(0) = O(X) = C. Here we use that a holomorphic 
function on compact Riemannian surface is constant. This gives 


g = dim L(Kx). (8.16) 











Now take D = Kx and get deg(Kx) = 2g — 2. 





Theorem 8.3. Let b;(X) = dim H;(X,R) be the Betti numbers of X. Then 
bı = 2g, bo = b2 = 1. 


Proof. Since X is a connected compact manifold of dimension 2, this is equivalent to 


(—1)'b;(X) = 2 — 2g = — deg(K x). (8.17) 


% 
I 


Let f be a non-constant meromorphic function on X (its existence follows from the 
Riemann-Roch theorem). It defines a holomorphic map f : X — P!(C). For any point 
x EX set 


(8.18) 
-ve(f) if F(z) = 00. 


It is a positive integer. Since deg(div(f — z)) = 0 we obtain 


5 a= 3 a (8.19) 


ze f—1(z) x€ f—t (co) 


ex(f) = e eee an 


Notice that, for any x € X, 


ex(f)-1 if f(z) 4.00 


(8.20) 
—ex(f)—1 if f(a) = ov. 


Vz(df) = 


101 


Here df is the meromorphic differential defined locally by of dt, where t is a local param- 
eter at x. Since the degree of df is finite we obtain that there are only finitely many 
points x € X such that e;(f) > 1. In particular, there is a finite subset of points 
S ={y1,...,ys} in P!(C) such that, for any y £ S 


> elf) =n = #f7'(y). (8.21) 
a: f(x)=y 


Taking into account the formulas (??)-(8.21), we obtain 


29-2= X ve(div(df))= X> (ex(f)-1)+ SY (-es(f)-1) = 


rex y:f (y)#oo yf (y)=o0 
X (ec(f) -1)-2 > = X (ex(f) -— 1) - 2n = (8.22) 
zrEX f (x)=00 zEX 
X(n- #f7"(y)) - 2n. 
yeY 


This is called the Hurwitz formula. The number n here is called the degree of the mero- 
morphic function f. Formula (8.21) says that this number is equal to # f~t! (y) for almost 
all y € P(C). 

We shall define the triangulation of X as follows. Take a triangulation T of P!(C) 
in which each point y; is a vertex. Consider the pre-image 7’ of this triangulation in X. 
Since, the restriction of f to P'(C)\.9 is a covering map, the open cell of our triangulation 
are equal to connected components of the pre-images of open cells of the triangulation of 
the sphere. Let do, d1, d2 be the number of 0-,1-, and 2-cells 7. Then we have nd, 1- and 
ndə 2-cells in T”. We also have X es #f ~1(y) 0-cells in 7”. By the Euler formula we 
have 


eX) = SO # f(y) - ndı + ndz = 


yes: 


XO #f ~ (u) + n(e(P(C)) — n#S = 2n — SO(n — #F*(y)). 


yes yes 











Comparing this with (8.22) we obtain the assertion of the Theorem. 





Example 8.2. Let X = P! (C). Take w = dz on the complement of oo and w = —z?d 
on the complement to 0. Then div(w) = —2o00. Hence deg(K'x ) = —2. This shows that 
g = 0 for the Riemann sphere. Of course this agrees with the topological definition of the 
genus. 


Example 8.3. Let X = E- be a complex torus. The holomorphic differential form w = dz 
on C is invariant with respect to translations. Hence it descends to a 1-differential on X. 
Obviously its divisor is zero. Thus deg(Kx) = 0 and the genus equals 1. Again this 
agrees with the topological definition. 
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8.5 Let us compute the genus of the Riemann surface X = H*/T. Consider the mero- 
morphic function j(7). Since it is a meromorphic modular form of weight 0 with respect 
to ['(1) it is a also a meromorphic modular form of weight 0 with respect to r. Hence 
it can be considered as a meromorphic function on X. Let m : X — H*/T(1) be the 
canonical projection. Since j, considered as a function on H*/T(1) has a unique sim- 
ple pole at oo, we may identify j with the pull-back 7*(z) of the coordinate function 
z on P!(C). We use the Hurwitz formula (8.22) from the proof of Theorem 8.4. Let 
e=T-7TE€X. ifr gT(1)-iUTrT(1)-pUT(1) - ~, then x has an open neighborhood 
holomorphically isomorphic to an open neighborhood of 7 and an open neighborhood of 
n(x). Since j — j(z) = a*(z — j(x)), we see that ez(j) = 1. If r € T(1) - i, and 
I’, = {1}, then x has an open neighborhood isomorphic to an open neighborhood U of 
T but j(x) = j(7) has an open neighborhood isomorphic to U/T(1)-. This shows that 
j — j(x) = n*(z — j(x)) vanishes at x with order 2, i.e. ez(j) = 2. If r € T'(1) - i, but 
T- Æ {1}, then x has an open neighborhood isomorphic to an open neighborhood U of 
j(x), hence e,(f) = 1. Similarly we find that e,(f) = 3if7 € [(1)-pandT, = {1} 
and ez(j) = lif r € T(1)- pandT, Æ {1}. Finally, if x is a cusp of index h, then x has 
an open neighborhood U isomorphic to U./(T"), where U. = {r : Im > c} U oo, and 
j(x) = oo has an open neighborhood V isomorphic to U./(T’). The restriction of 7 to U 
is given by sending a local parameter in V to the h-th power of a local parameter in U. 
Since 1/z is a local parameter co, j has a pole at x of order h. This shows that v,(j) = h 
and hence e,(7) = h. 

To collect everything together and state a formula for the genus of X, let us make the 
following: 


Definition. Let X = H*/T. A point x =T -7 is called an elliptic point of order 2 (resp, 
of order 3) if r € T(1) - i (resp. 7 € P(1)- p) and T; A 1. 


Theorem 8.4. The genus of H* /T is equal to 


HT T2 T3 Too 
12 4 3 2? 


where ur is the index of T/T A (£1) in T(1)/(41), r2 is the number of elliptic points of 
I of order 2, r3 is the number of elliptic points of I of order 3, and ræ is the number of 
cusps of I. 


g=1+ 











Proof. Notice first that the number ur is equal to the degree of the meromorphic function 
X(T) —> X(T(1)) S P(C) defined by the j-function j : H — C. In fact, the number 
of the points in the pre-image of a general z € C is equal to the number of [’-orbits in H 
contained in a '(1)-orbit. Applying (8.22), we have 


2g — 2 = —2u + X (es(j) — 1) = 
rEX 
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—2u+ 5 (ex(j)— 1) + `> (es(i) — 1) + (ez(j)— 1). 


j(x)=J (0) j(2)=J(p) j(x)=00 


We have (u — r2)/2 points over j(i) with ez(j) = 2 and (u — r3)/3 points over j (p) with 
ex(j) = 3. Also by (8.21), the sum of indices of cusps is equal to u. This gives 





2g — 2 = —2u + (u — r2)/2 + 2(u — 13) /3 + (u — ræ), 


hence 
H r2 T3 Too 


12 4 3 Or 

















We shall concentrate on the special subgroups T of T (1) introduced earlier. They are 
the principal congruence subgroup r (N) of level N and 


To(N) = iC 4 € SL(2,Z) : NJ}. 


Obviously 
P(N) CTo(N) 
Lemma 8.5. 
HN (= HT(N) = f piv if N =2 (8.23) 
Hon = Hron) = (P(A) :To(N)] = N [0 +, 
p|N 


where p denotes a prime number. 


Proof. This easily follows from considering the action of the group SL(2, Z/N) on the 
set (Z/N)?. The isotropy subgroup of the vector (1,0) is isomorphic to the group of 
To(N)/T(N) c SL(2, Z/N). It consists of matrices of the form (5 fe) . The number 
of invertible elements a in the ring Z/N is equal to the value of the Euler function (N). 
The number of elements b is N. This gives the index of T(N) in [o(V). The index of 
To(N) in T(1) is equal to the number of elements in the orbit of (1,0). It is the set of 
pairs (a,b) € Z/N which are coprime modulo N. This is easy to compute. 














Lemma 8.6. There are no elliptic points for T(N) if N > 1. The number of cusps is 
equal to uy /N. Each of them is of order N. 


104 LECTURE 8. THE MODULAR CURVE 





Proof. The subgroup T = T(N) is normal in T'(1). If T- 4 {+1}, then g'g~! = T; for 
any g € I'(1) which sends 7 to į. Similarly for elliptic points of order 3 we get a subgroup 
of T fixing 2/3. It is easy to see that only the matrices 1 or —1, if N = 2, from T(N) 
satisfy this property. We leave to the reader to prove the assertion about the cusps. 














Next computation will be given without proof. The reader is referred to [Shimura]. 


Lemma 8.7. The number of elliptic points and cusps for the group To(N) is given by the 
following formula: 


(i) 
mat) -1 pe (8.24) 
Lan + (> )) otherwise. 
(ii) 
=) oe (8.25) 
ES pw F (=) otherwise. á 
(iii) 


ro= > (4,2). 


d|N,d>0 


Here ¢ is the Euler function and (-) is the Legendre symbol of quadratic residue. We 


have : 
24 0 ifp=2, 
(—)=41 if p = 1 mod 4, (8.26) 
P —1 ifp=3mod4, 
ifp = 3, 
= =41 ifp=1mod3, (8.27) 


—1 ifp=2mod3. 


Applying the previous lemmas we obtain 


Corollary 8.3. The genus gy of the Riemann surface X(N) = H*/T(N) is given by the 
formula 


1 4 EN(V—6) FN >1, 
w= 1)" 12N: if (8.28) 


ifN =1. 
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Here we use that —I ¢ T(N) for N > 1. We know that the Riemann surface X (1) = 


H* /T(1) parametrizes isomorphism clases of elliptic curves. 


For any elliptic curve Æ we denote by pE the subgroup of N-torsion points. If 
E = C/A we have 


1 


Theorem 8.5. There is a natural bijective map between the set of points of X(NY = 
X(N) \ {cusps} and isomorhism classes of pairs (E, ġ), where E is an elliptic curve 
and ọ : (Z/N)? + nE is an isomorphism of groups. Two pairs (E, p) and (E',¢') are 
called isomorphic if there exists an isomorphism f : E — E' of elliptic curves such that 


fop= g. 


Proof. Let E = C/A. Then yE = A/A. An isomorphism ¢ : (Z/N)? > nE is 
defined by a choice of a basis in yE. A representative of a basis is an ordered pair of 
vectors (a, b) from A such that (Na, Nb) is a basis of A. Replacing E by an isomorphic 
curve, we may assume that A = Z + TZ for some T € H and (Na, Nb) = (1,7). This 
defines a surjective map from H to the set of isomorphism classes of pairs (E, 6). Assume 
the pair (E+, ($, Ẹ)) is isomorphic to the pair (E~, (x, =) Since Æ’ is isomorphic to 


E, we get T' = ST for some M = 4 





€ Tr (1). The corresponding isomorphism 


is induced by the isomorphism of C, z —> z(y7 + ô). It sends 1/N to (yr + 6)/N and 
T'/N to (at + 8)/N. It is easy to see that 


(at+ 6)/N=7T/N moduloA <=> a=1 moduloN, 8=0 modulo N 





(yr +6)/N=1/N moduloA <= ô=1 moduloN, y=0 modulo N. 


This shows that 7 and 7’ define isomorphic pairs (E+, ¢), (Er, ¢’) if and only if they 
differ by an element of T(N). 














Remark 8.2. Since T(N) is an invariant subgroup of T (1) the factor group 

Tr(1)/T(N) = SL, Z/N) acts naturally on X(N) and the orbit space is isomorphic to 
X (1). If one uses the interpretation of H/T (N) given in the theorem, then it is easy to 
see that the action of an element o € SL(2, Z/N) is defined by sending the isomorphism 
class of a pair (E, @) to the isomorphism class of the pair (E, ø o ). 


Theorem 8.6. There is a natural bijective map between the set of points of Xo(N)! = 
Xo(N) \ {cusps} and isomorhism classes of pairs (E, H), where E is an elliptic curve 
and H is a cyclic subgroup of order N of yE Two pairs (E, H) and (E’, H’) are called 
isomorphic if there exists an isomorphism f : E — F' of elliptic curves such that f(H) = 
H’. 
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Proof. It is similar to the previous proof and is left to the reader. 


Remark 8.3. There is a natural interpretation of the cusp points as the isomorphism classes 
of certain degenerate pairs (Æ, H) but to explain this is beyond of the scope of these 
lectures. 


8.6 Finally we interpret the spaces M;(I) as the spaces L(D) for some D on the Rie- 
mann surface X(T). To state it in a convenient form let us generalize divisors to admit 
rational coefficients. We define a Q-divisor as a function D : X — Q with a finite sup- 
port. We continue to write D as a formal linear combination D = )° a,x of points x € X 
with rational coefficients a. The set of Q-divisors form an abelian group which we shall 
denote by Div(X)g. For any x € Q we denote by |x] the largest integer less or equal 
than x. For any Q-divisor D = -` agx we set 


[Pl = S ae le: 


Theorem 8.7. Let 


r3 


IR 2 = P= 
ee Paces y tit DG Diag De 


i=r1+1 
where x£1,..., £r are elliptic points of order 2, £r1+1, .- . - , £rı2+r are elliptic points of 
order 3 and c1, .. . , T% are cusps. There is a canonical isomorphism of vector spaces 


My(E) © L(kKx + [kD])), Mal)? = L(kKx + |kD°]). 
Proof. Let F € Mp(T). We define its divisor 


div(F) = $ ve(F)z € Div(X)a, 


xEX 
by setting 
ty,(F) if =[- 7 isan elliptic point of order e, 
(= ) if cis a representative of a cusp x. 


Vel F 
v:(F) if x =T - 7 is neither an elliptic point nor a cusp. 


Here v, (F) = n, where an(z — T)” + .. . ,an # 0 is the the Taylor expansion of F at 7. 
Similarly, ve( F`) is the smallest non-zero power of q = e?7‘/" which occurs in the Fourier 
expansion of F at the cusp c of order h. 
Consider the j-function j : H — P!(C) as a meromorphic T-invariant function on H. 
Its derivative satifies 
d .att+ 


i AE A 


aTt+ 6. d 
yt +ô’ dr 


at+ p 
yr +6 


= araar SED, 








( 
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This shows that ®(r) = j’(r)* satisfies 





So if we consider the ratio F'(7)/®(7) we obtain a T-invariant meromorphic function on 
H. Obviously ® is meromorphic at the cusps. So this function descends to a meromorphic 
function on X. Let us compute its divisor. Let x = I -r € X and t be a local parameter 
at x. We know that v, (TẸ (t)) = e(x) where e(x) = 1,2 or 3 dependent on whether x is 
not an elliptic point, an elliptic point of order 2, or an elliptic point of order 3. Thus 


_ vr(F) — v,(®) 


= (F) — v,(®). 
Let us compute v,(®). We know that 


2 ifieT(1)7, 
vlj- jT) = 43 ife ET(1)r, 


1 otherwise. 


This immediately implies that 


1 ificr(i)-r, 
v(j)=42 af ere Pie; 
0 otherwise. 


Thus 
Vz(®) = k(es(j) — 1)/e(2). 


Now, let x = c; be a cusp represented by c € P!(Q). We used the local parameter 
e?TiT/h to define ve( F). Since j admits the Fourier expansion e~ 277 +744-+c,e277 +... 
at oo, we see that j’ has the expansion —27ie~ 27" + co2rie?™'* +... at the cusp c. This 
shows that v.(®) = —kh. So we get 





div(F'/®) = div(F) Bye y Dy ) V hici. 
z i=1 


e(x 


Comparing this with the computation of div(dj) in the proof of Theorem 8.5, we get 


div(F’) = div(F'/®) + kdiv(dj) +k 5 (1—e(x) t)r + k Se 
i=1 


elliptic x 
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Since div(F) > 0 we obtain that F/® € L(D‘), where D’ is linearly equivalent to 
kKx + |kD] as in the assertion of the theorem. Conversely, if Y € L(D’) we easily get 
that F = Y € M, (TL). Finally, if F is a cuspidal modular form, we have vs(F) > 0 




















at cusps. This easily implies that F/® € L(D’ — cı — ... — Cr). This proves the 
theorem. 
Corollary 8.4. 
2k — 1)(g — 1) + kro k/2 2k/3] ifk>1, 
dim My(t) = [CE7 DU- D + kroo +ralk/2] +r3|2k/3] ifk > 
g+ræ-— 1 ifk=1. 
2k—1 1) 4 Lroo 4 2 2k j 1, 
dim matr? = [CF DO- 1+ (E 1)roo +ralk/2] +ral2k/3] ik > 
g ifk=1. 
Proof. This follows immediately from the Riemann-Roch theorem (since 
deg(kKx + |kD]|) > deg Kx, the space L(Kx — (kKx + |kD])) = {0}). O 


Corollary 8.5. Let fo,..., fy be a basis of the space Mg(T). Then the map 
f:H>PN(C), rE o aae) 
defines an isomorphism from X (T) onto a projective algebraic curve in PN (C). 


Proof. We know this already when T = T(1). So we may assume that wp > 1. By 
Theorem 8.5 we can identify the space M6(I‘) with L(D), where 


deg D = 6 deg Kx + 6ræ + 3r2 + 4r3 = 12g — 12 + Gra + 3r2 + 4r3. 


I claim that deg D > 2g + 1. If g > 0 this is obvious. If g = 0 we use the formula for the 
genus from Theorem 8.4. It easily gives that 


—12 + 6ræ + 3r2 + 4r3 =p > 2g +1 = 1. 
It follows from the proof of Theorem 8.8 that 
Valfi) = val fi/F'®) + D(z). (8.28) 


Now we use the standard argument from the theory of algebraic curves. First of all the map 
is well-defined. In fact, if all functions f; vanish at the same point x, we obtain vz (fi) > 0 
for alli = 0,...,.N, and hence v,(f;/j’°) +D(a)—1 > 0fori = 0,..., N. This implies 
that L(D) = L(D — x). However, this contradicts the Riemann-Roch theorem: since 


deg(Kx — D) < deg(Kx — (D — x)) = 2g — 2 — deg D+1<0, 
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it gives dim L(D) = deg D+ 1 — g > dim L(D — x) = deg D — 1 + 1 — g. Suppose 
f(x) = f(x") = p € PY (C) for some x 4 x’. Without loss of generality we may assume 
that p = (1,0,...,0) (to achieve this we make a linear transformation of coordinates). It 
follows from (8.23) that f;/7j’° € L(D — x — 2'),i = 1,...,.N. This contradicts again 
Riemann-Roch. We have 


deg(Kx — (D — z — x')) = 2g — 2 — deg D +2 = 2g — deg D <0. 


Thus N < dim L(D — z — x') = deg D — 2 + 1 — g = dim L(D) — 2 = N — 1. This 
contradiction proves that our map is injective. To show that it is an isomorphism onto the 
image, we have to check that its derivative at each point does not vanish. It is easy to see 
that this is equivalent to the fact that L(D — x) # L(D — 2x) for any x € X. This is 
proved by the similar argument as before using the Riemann-Roch theorem. 














Corollary 8.6. Let R(X (T)) be the field generated by homogeneous fractions f /g, where 
f, g are modular forms of the same weight. Then 


R(XT)) = M(X(P)). 


Proof. It is easy to see that R(X (L)) is the field of rational functions on the image of the 
curve X (T) in P^ (C). Now we apply the Chow theorem that says that any meromorphic 
function on a projective algebraic variety is a rational function. 














Exercises 


8.1 Show that H* is not locally compact. 
8.2 Find all N for which the modular curve X(N) = X(T(N)) has genus 0 and 1. 
8.3 Find all N for which the modular curve Xo(V) = X (To(N)) has genus 0. 


8.4 Find all normal subgroups T of T (1) for which the genus of the modular curve X (T) 
is equal to 0. [Hint: Use Theorem 10.4 and prove that rə = up /2,7r3 = wr /3, Tool Lr). 
8.5 Generalize the Hurwitz formula to any non-constant holomorphic map f : X — Y of 
compact Riemann surfaces. 


8.6 Show that the Moebius transformation 7 — —1/Nr defines a holomorphic auto- 
morphism of finite order 2 of the modular curve Xo(V). Give an interpretation of this 
automorphism if one identifies the points of Xo(V) with isomorphism clases of pairs 
(E, H) as in Theorem 8.7. 


8.7 Let 
T,(N) = ic A) Tin sek. mod we 
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Give an analogue of Theorems 8.6 and 8.7 for the curve H/T; (V). 

8.8 Using Riemann-Roch theorem prove that any compact Riemann surface of genus 0 is 
isomorphic to P!(C). 

8.9 Using Riemann-Roch theorem prove that any compact Riemann surface of genus 1 is 
isomorphic to a complex torus C/A. 

8.10 Compute the dimension of the space M1(X0(11)). 


8.11 Using the fact that H/T(1) = C prove that any nonsingular plane curve of degree 3 
in P?(C) is isomorphic to a complex torus. 


8.12 Show that any modular curve of positive genus has at least two cusps. 


8.13 Find the genus of the curve X (7). Show that the cuspidal forms of weight 1 define 
an isomorphism from X (7) onto a plane curve of degree 4. 


8.14 Let N = 2,3,4,6,12 and k = 12/N. Show that the space of cuspidal forms 
My;,('(N))° is spanned by the function A(7) xN 
8.15 Consider the Hesse equation z? + y? + 2° + yxyz = 0 from Lecture 3. 


(i) Show that it defines an elliptic curve Æ(y) together with an isomorphism ¢ : 
(Z,/3)> > 3E. 


(ii) Show that the coefficient y considered as a function on H/T (3) is a modular func- 
tion generating the field M(X (3)). 


(iii) Show that the value of the absolute invariant function j(7) on the isomorphism 
class of E (y) is equal to 


(216 — +7°)°4? 


I) = (3 — 27)3 


[Hint: Find its Weierstrass equation by projecting the curve from the point (0, 1, —1).] 


8.16 Desribe explictly the action of SL(2, Z/3) on the field M(X (3)) (see Remark 8.2) 
as follows: 


(i) Show that —I € SL(2, Z/3) acts identically. 





(ii) Show that PSL(2, F3) = is generated by the elements T = (4 1) and S = (92). 











(viii) Show that PSL(2, F3) acts on the field M(X (3)) by transforming its generator y 
as follwos:T : a 3 e27/3a, T : y > eras 


Lecture 9 


Absolute Invariant and Cross-Ratio 


9.1 Let 
xı = (a1,b1), 22 =(a2,b2), 23 = (a3,63), 2&4 = (a4, ba) 


be four distinct points on P(C). The expression 

















ay bı||a3 b3 
b b 
pa a |a. bs 0.1) 
ay by| Jag be 
az b3||a4 b4 














is called the cross-ratio of the four points. As is easy to see it does not depend on the 
choice of projective coordinates of the points. Also it is unchanged under the projective 
linear transformation of P1 (C): 


(x,y) > (ax + by, cx + dy). 


If none of the points is equal to the infinity point oo = (0,1) we can write each z; as 
(1, zi) and rewrite R in the form 


(z2 — 21) (za — 23) 
(23 — 21) (24 — 22) 





R= (9.2) 


One can view the cross-ratio function as a function on the space 
X = (P'(C))*\A 


of ordered fourtuples of distinct points in P'(C). Here A denotes the “diagonal”, the set 
of 4-tuples with at least two coordinates equal. The group GL(2, C) acts naturally on X 
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by transforming each (x1, £2, £3, £4) in (g- £1, g - £2, g - £3, g - x4) and R is an invariant 
function with respect to this action. In other words, R descends to a function on the orbit 
space 

R: X/GL(2,C) > C. 


The following is a classical result from the theory of invariants: 
Theorem 9.1. The cross-ratio R defines a bijective map 
R: X/GL(2,C) > C \ {0,1}. 


Proof. Let (x1, £2, £3,£4) E X. Solving a system of three linear equations with 4 un- 
knowns a, b, c, d we find a transformation g : (x, y) > (ax + by, cx + dy) such that 


g: (a2, b2) = (1,0), g: (a3, b3) = (0, 1), 


g: (a4, b4) = (1,1), g- (a1, b1) F (1,A), 


for some A # 0,1. We recall that two proportional vectors define the same point. This 
allows us to choose a representative of each orbit in the form (A, 0,00, 1), where we now 
identify points in P!(C) \ {oo} with complex numbers. Since the cross-ratio does not 
depend on the representative of an orbit, we obtain from (9.1) 


R(x, T2, T3, x4) =À. 


Since A takes any value except 0 and 1, we obtain that the image of R is equal to C\{0, 1}. 
Also it is immediate to see that A and hence the orbit is uniquely determined by the value 
of R. 














Now let us take an orbit from X/GL(2, Z) represented by (A, 0, 00, 1) and assign to 
it the cubic curve given in affine coordinates by the Legendre equation : 


E(X) : y? — z(x — 1)(£ — à) = 0. (9.3) 


This equation can be easily transformed to a Weierstrass equation by a linear change 


of variables 2’ = x + HA y = 2y. In particular, we see that the functions (p(z) — 


Ha, p(z) /2) define an isomorphism from a torus E, = C/A, to E(A) for an appropriate 
T € H. We know that the zeroes of p(z)’ are the points in 5A and hence the points 
(x,y) = (0,0), (1,0), (A, 0) are the non-trivial 2-torsion points on Æ(A) (the trivial one 
goes to the infinity point (0, 1,0) € P?(C)). If we take the first two points as a basis in the 
group of 2-torsion points 2/(A) we obtain that E'(X) defines an isomorphism class of an 
elliptic curve together with a basis of its group of 2-torsion points. In other words, E (A) 


represents a point in the moduli space H/T (2). Conversely, given a point in H/T (2), we 
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can represent it by the isomorphism class of some E with a basis of 2F given by (5, 5) 
modulo A. The points 


T 1 


(E1, 22,23,24) = (PG + 5), 0(5):00,0(5)) 04) 


define an ordered 4-tuple of points in P(C), and hence an orbit from X. Replacing 7 


arte where e 3) € T(2), the point x; changes to (yT + 6)?xj,i = 2,3,4 
(see Example 6.5). This shows that the cross-ratio R(x, £2, 73, £4) does not depend on 


the choice of 7 representing a point in H/T (2). Together with Theorem 9.1, this proves 


with 7’ = 





Theorem 9.2. There is a natural bijection between the set of ordered 4-tuples of distinct 
points in P(C) modulo projective transformation and the points in H/T (2). 


9.2 In view of this theorem the cross-ratio R can be thought as a function 
R:H/T(2) > C. 


The next theorem shows that this function extends to a meromorphic function on X (2) = 


H*/T(2): 


Theorem 9.3. The cross-ratio function R extends to a meromorphic function A on X (2) 
which generates the field M(X (2)). It can be explicitly given by the formula 


A(T) = vo; (0; T)*/9o0(0;7)*. 


Proof. It follows from the previous discussion that, as a function on H, the cross-ratio is 
given by 





Hee, PE T,  @AF+5)— (5) 
R= R(p(= + =), P(=), ~, = f 9.5 
OGHA = Fae 0.5) 
We have 
dim M(T(2)) = 1 — 2k + ku2/2 = k +1. (9.6) 


In particular 

dim Mı (T(2)) = 2. 
We have seen in Lecture 6 that vĝg, vi äi 
ular forms of weight 1 with respect to the group I (2). There must be some linear relation 
between these functions. The explicit relation between the first set is known as Jacobi’s 
identity between theta constants: 


wea and (5), (4), (4) are examples of mod- 


vho = vi +91. (9.7) 
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The proof easily follows from the transformation formulas for the theta constants from 
Lecture 5. Write Vo = cvi mar c0 ı for some constants c1, c2. Replace 7 with —1/7 
2 2 


and use (5.8),(5.9) to obtain that cı = c2. Next replace 7 with 7 + 1 and use (5.3), (5.4) 
to see that cy = co = 1. 
The relation between the functions from the second set is the obvious one: 


T+1 T 1 
p( 


z ITA TPG) = 0. (9.8) 

It follows from the Weierstrass equation (the sum of zeroes of the cubic polynomial 4° — 
g2% — g3 is equal to zero). 

Now let us find the relations between functions from the first set and and the second 


one. We must have o(4) = ci 1 + cot P for some constants c1, C2. Applying the 





2 
2 2 
transformation T — 1 + 7 and using formulae (5.2)-(5.4) from Lecture 5, we see that 
cı = 2cg. Using the Fourier expansion of (53 T) given in Lecture 6, we obtain that 


C= oa Thus 





1 2/1 94 1 4 
Similarly we obtain 
Toz ao, l 4 14 
rT 1 Aoi ide. A 1 4 
SP = (2 — — — 0i). 11 
Adding up we check the relation (9.8). Subtracting we obtain Thomae’s Formulae: 
1 
T9 = (5) — (7/2), 
2 o4 Tog od 
n Vig 5 pls F 5) = (7/2), (9.12) 
2 2742 
1 Ts, d 
RY EE faces mY A 
mut, = 95) — (5 + 5). 
Now we can find an expression for the cross-ratio: 
oF + 5) — 0(5) 4 j.g4 
R= T =? 1/Vo0- (9.13) 
(5) — (3) °2 


It remains to show that the function \ = 04 i /95o generates the field of meromorphic 
2 


functions on X(T(2)). The algebra M (T(2)) contains the subalgebra C[W4, , Jf]. Us- 


op? 
ing (9.6) we can compare the dimensions of the subspaces of homogeneous elements of 
degree k to see that the algebras coincide. Thus 


M(T(2)) = C[941, 9o]. (9.14) 


1) 
05 
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By Corollary 8.6, the field M(X (T)) is isomorphic to the field of quotients of the algebra 
M(T). This implies that À generates the field M(X (T(2)). 














Definition. The modular function 
A= V1 /9bo 


with respect to T (2) is called the lambda-function. 


Let m : X(2) — X(1) be the natural holomorphic map defined by the inclusion 
Tr(2) c [(1). The pre-image of the absolute invariant 7*(j) is a meromorphic function 
on X(2) and hence must be a rational function in À. Let us find the explicit expression 
for this rational function. 

Theorem 9.4. 
(1— A+ A?)3 
A2(1— A)? © 


Proof. We know that (5), (5) and (5 + 3) are the three roots x1, 72, x3 of the equa- 


tion 4z? — gox — g3 = 0. Thus 


jae 





g2 = —4(21 x2 + 1123 + L273) = —2[(x1 + £2 +23)” 
(zi + 29 + 13)] = 2(a} + 13 + 23). 
Applying formulas (9.9)-(9.11), we obtain 
F Le 


a)4 
n= Hol) + oS) + 05 + 2) = © 


Using the Jacobi Theorem from Lecture 4 , we have 


g3 — 2792 = (27)? A = (27)? (27) "64 S= (2 92 91 V00: 





8 1 498 4 94 
(Fig + G1 Hoi) C15) 


NI= 


Using (9.7), we get 


12/,98 8 4 4 \3 


j ae, = femmen 
g3 — 2793 (24) 7951 93 V0 





28 (Vio i voa (Boo = 81)" K (1 = A+ A238 


B51 Vdo (Vdo = oi)? m X-A 
2 2 

















Note that there are exactly 6 = 3! values of À (counting with appropriate multiplic- 
ities) which give the same value of j. This corresponds to the orbit of [(1)/T'(2) = 
SL(2, F2) = S3 in its natural action on X(2). This shows that there are 6 values of the 
parameter in the equation (9.3) which define isomorphic elliptic curves. 
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Exercises 


9.1 Let p € P?(C) and l, la, l3, l4 be four distinct lines passing through p. For any line l 
in the plane not passing through p let p; = LO l;, i = 1, 2,3, 4. Show that the cross-ratio of 
the four points p1, p2, p3, pa does not depend on the choice of an isomorphism } S P!(C) 
and also does not depend on the choice of the line J. 


9.2 Find the expression for g3 in terms of the fourth powers of theta constants. 


9.3 
(i) 
(ii) 
(iii) 


(iv) 


(v) 


9.4 


© 


Gi 


9.5 


© 


(ii) 


Show that an unordered set of four points defines at most 6 different cross-ratia. 
Find the sets of unordered 4 points for which the cross-ratio takes less than 6 values. 


Show that the exceptional sets of points from (ii) correspond to harmonic or anhar- 
monic elliptic curves. 


Verify that the function j = j(A) from Theorem 9.4 takes the same value at all six 
cross-ratia. 


Show that there is a natural bijection between the sets of 4 distinct points in P! (C) 
modulo projective transformation and isomorphism classes of elliptic curves. 


Show that the permutation group S4 contains a normal subgroup H of order 4 
which acts identically on P1(C)*/GL(2, C) via its natural action on P'(C)* by 
permuting the factors. 


Show that S4/H = S3 = SL(2,F2) and the action of S4/H on the orbit space 
(P!(C)* \ A)/GL(2, C) corresponds to the action of SL(2, F2) on X (2) under the 
identification of (P'(C)* \ A)/GL(2, C) with X (2). 


Show that the affine curve y? = (1 — x?)(1 — Ax?) is birationally isomorphic to 
the curve y? = x(x — 1)(x + Ax). Show that the exists an elliptic function sn(z) 
(called the Jacobi sine function) such that (sn(z)’)? = (1 — sn(z))?(1 + Asn(z)?). 


Define the Jacobi cosine function cn(z) by cn(z) = sn(z)’. 


formula 


Prove the addition 


_ sn(z)en(w) + sn(w)en(v) 
1 +Asn(z)2sn(w)? 





Lecture 10 


The Modular Equation 


10.1 In this lecture we shall prove that the modular curve Xo(V) can be defined by 
homogeneous algebraic equations with coefficients in Z. By reducing the coefficients 
modulo a prime p we obtain a nonsingular projective algebraic curve over a finite field Fp 
for all prime p except finitely many. 

We shall start with the following 


Lemma 10.1. Let T and I’ be subgroups of finite index in T (1). Assume that there exists 


a matrix A = 6 4 € SL(2,R) such that T' C A~!.T- A. Then, for any f € Mz (T), 


ar+ £6 


eae bid). 


flA= f( 





Proof. We have checked it in Chapter 6 for the case A € SL(2, Z). But this assumption 
has not been used in the proof. 


Corollary 10.1. For f(T) € M(T(1))ẹ we have 
f(N7r) € M(To(N))k. 














In particular, 
f(N7)/f(r) € M(Xo(N)). 
Proof. Take 
af 0. SN 
F = (Jx 0 ) ; (10.1) 


We have, for any M € T(1), 


parisi A) 6) te )- 
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Clearly, this implies that To(N) c F-T(1)- F7}. Now 


fleF = f(-1/Nr)(N2@r)-%* = f(NT)(NT)* (NT) ™ = NF f(N7). 





This checks the assertion. 











Example 10.1. Take N = 2 and f = A(T) € M(I\(1))6. We see that A(27)/A(T) 
belongs to the space M(Xo(2)). Observe that q = e?’7 changes to q? when we replace 
T with 27. So 





2 J% 2m)24 0° 
A Ria pee) _ my24 _ 9-126, (7)24 
2r)/AC) = “Te a gry A y= 2hr), (10.2) 


where f2(7) is the Weber function defined in (4.13). In particular, we see that 

fo = 2° A(27)/A(7) (10.3) 
is a modular function with respect to ['9(2). It follows from (10.2) that f24 has a simple 
zero at the cusp oo. The index of this cusp is equal to 1 since (4 t) € T'9(2). We know 


from Lemma 8.5 that juo,2 = [['(1) : To(2)] = 3. Thus To(2) has another cusp of index 2. 
Since 0 ¢ To(2) - 00 we can represent it by 0. We have 


2 (1/7) = 2° A(-2/7)/A(-1/7) = 2° A(-1/(r/2))/A(-1/7) = 
2" (r/2)A(T/2)/T A(T) = A(r/2)/A(r) = 


q? Heada =q r] (1 + g2 
aTi — g) 


m=1 


Nie 





This shows that f2 has a simple pole at the second cusp. Since f3* is obviously holo- 
morphic on H we conclude that it has a single pole of order 1. This implies that the 
meromorphic function f3* : Xo(2) —> P!(C) has degree 1 and hence maps Xo(2) isomor- 
phically onto P!(C). In particular, {3+ being the inverse transform of the rational function 
z on P!(C) generates the field of rational function on Xo(2): 


M(Xo(2)) = C(A(2r)/A(r)) = C(f3"). (10.4) 
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10.2 It follows from the Corollary 10.1 that j(V7) belongs to the field M(Xo(N)). 
This field contains the field M(X(1)) = C(j(r)) as a subfield and the degree of the 
extension is equal to fo,v. We shall prove that j(N7T) generates the extension, i.e. 
M(Xo(N)) = C(j(7),7(N7T)). We will also describe the algebraic relation between 
g(r) and j(N7). 


Lemma 10.2. For any natural N, 


ra) (4 1) TQ) = L] ra, 


AEAN 


where Ay is the set of integral matrices (4 b) withd > 0,ad = N,0 < b < d, (a,b, d) = 
1. The number of elements in Ay is equal to uọ,N. 


Proof. First of all the right-hand side is the set M(N) of integral primitive (i.e. with 
g.c.d of entries equal to 1) matrices with determinant N. In fact, for any such matrix we 
can apply row transformations with matrices from I'(1) to reduce it to upper triangular 
form. By further row operations we can make d positive and b satisfy 0 < b < d. The 
number a will be the greatest common divisor of the first column of the matrix, so is 
defined uniquely. Then d will be defined uniquely by the condition ad = N and b will be 
defined uniquely by the above condition. It is obvious that the left-hand side is contained 
in M(N). To prove the opposite inclusion, it suffices to show that each matrix A from 
Aw is contained in the left-hand-side. This follows from the well known fact that each 
integral matrix can be transformed by integral row and column transformations to the 
unique matrix of the form ( 0 2, ), where n|n’. The last assertion can be checked by using 
elementary number theory. When N = p is prime, we obviously have #A, = p + 1. 
Now, if N is not prime we have 





#An = ¥(N) = N | [0 +p) = pow. 


p|N 





This can be proved by using the multiplicative property of the function y(n) and the 
formula 





v(N) = pola, 2), 











where ¢ is the Euler function. 





Lemma 10.3. Let f(T) be a modular function with respect to T (1) which is holomorphic 
on H and admits the Fourier expansion f = )<?°__,. nq". Then f is a polynomial in j (T) 
with coefficients in the subring of C generated by the Fourier coefficients co, ... , C_r- 
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Proof. Observe first that r > O unless f is constant. Since the Fourier expansion of j 
starts as q7} +... we can subtract c_,j” from f to decrease the order of its pole at oo. 
Then we do it again, if needed, untill we get that the difference g has Fourier expansion 
of the form q” +... with m > 0. Since g is holomorphic at infinity and vanishes there, 
it must be zero. Since all the coefficients of the Fourier expansion of j are integers, as a 
result we subtract from f a polynomial in j with coefficients in Z[c_,,..., Co] and obtain 
0. 














Lemma 10.4. Let f : X — Y be a holomorphic map of compact Riemann surfaces. 
Then f* : M(Y) > M(X) defines an algebraic extension of the field of meromorphic 
functions. The degree of this extension is equal to the number of points in the pre-image 
f1 (y) (counting with multiplicities equal to the ramification indices) for any y € Y. 


Proof. We skip the proof of this lemma. One can learn about this fact in any intruduction 
book in algebraic geometry. 














Theorem 10.1. The field M(Xo(N)) is generated by j(T) and j(Nr). There exists a poly- 
nomial ® vy [X,Y] € ZX, Y] such that F(j(Nr), j(7)) = 0. The polynomial ® y[X, j| € 
C(j)[X] is a minimal polynomial for j(NT) in the fields extension M(Xo(N))/M(X(1)). 
Its degree is poy. When N > 1, ®n[X,Y] is symmetric in X and Y, and if N = pis 
prime, 

Oy (XV Ss XP+ 4 yh a XPYP — XY modp. 





Proof. Let Ay be the set of matrices from Lemma 10.2. Consider the polynomial 


YN) 
d= || (X-T) = So mX” 
AEAN m=0 


Its coefficients sm are symmetric functions in j(A - T) and hence are holomorphic func- 
tions on H. It folows from Lemma 10.2 that, for each M € T(1) and A € Ay, we have 
AM = M'A for some M' € T(1), A’ € Ay. Thus 7(A- (M-7)) = 9(M'- (A’-7)) = 
j(A’- T). Thus replacing 7 by M - 7 defines a permutation among the functions j(A - 7). 
This implies that sm are modular functions with respect to T (1). By Lemma 10.3, each 
Sm is a polynomial in 7(7) with coefficients belonging to the subring of C generated by 
its Fourier coefficients. However, for any A = ( o 2) € An, we have 


2ri(at+b) 2riat 2nib 








e d =e @deds= qa, 
where q = e?7’7, as usual, and Cy is the primitive d-th root of unity equal to et/d_ Now, 
using the Fourier expansion of j(7) we obtain 

aT +b 1 , ab 

il d ) = qalacé j b(q4¢y), (10.5) 
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where ¢ is holomorphic at infinity. Since the coefficients of j are integers we see that the 
coefficients of the Fourier expansion of each j(A-7) belong to the ring Z[¢q]. By Lemma 
10.3, the coefficients Sm are polynomials in j(7) with coefficients in Z|¢n]. Consider 
the automorphism of the cyclotomic field Q(¢y) which acts by sending Cy to ¢%,, where 
(k, N) = 1. It is clear from (10.5) that this automorphism transforms j(A-7) to j(A’-7) 
for some other A’ € Ay. This shows that the functions sm are invariant with respect to 
all such automorphisms, hence must be polynomials in 7 with coefficients in Z. 
Thus we can consider ® as an element of the ring Z[X, j]. Replacing the variable 
j with Y we obtain the polynomial ®y(X,Y) € Z[X,Y]. This will be the polynomial 
N 0 
0 1 
®v(j(N7T)),7) = 0. The polynomial ®y[X, j] is of degree y(N) and is irreducible 
since its roots j(A - T) are permuted transitively by the group T (1). By Lemma 10.4, its 
degree is equal to the degree of the extension M(Xo(N))/M(X(1)). Since ®y[X, j] 
is the minimal polynomial for j(V7) over the field C(j) = M(X(1)), and its degree is 
equal to the degree of the extension, we see that j(7) and j(NT) generate M(Xo(N)). 
Next, replacing 7 with —1/N7 in the identity ®y(j(N7T), 7) = 0, we obtain 


from the assertion of the theorem. First of all, taking A = € Ay we obtain 


®n(5(—1/7), 5(-1/N7)) = On 9, 5(N7)) = 0. 


Since ®y(X, j) is irreducible as a polynomial in X, the polynomial ®j(j, X) must be 
divisible by ®y(X, 7). It follows from the Gauss lemma that 

Oy (X,Y) = cOn(Y, X), where c = +1. If c = —1, we have ®y(X, X) = 0, hence 
®v(j,j) = 0. However, ®y(X, 7) is irreducible over C(j) hence j cannot be its zero. 
So c = 1 and we obtain that ®y (X,Y) is symmetric in X,Y. It remains to prove the last 
property (Kronecker’s congruence relation). 





1 
Assume N = pis prime. Then the set A, consists of matrices A, = ( 0 5) , 0< 


s < p, and Ap = 6 a It follows from the formula (10.5) and the Fermat theorem 


that we have the following congruence for the Fourier expansion of j (As - T) in q? 
j(As - T)(4) = j(q)"? mod (1- &), 


j(Ap + T)(q) = j(q)? mod p. 


Here the congruence means that the corresponding Fourier coefficients satisfy the con- 
gruence. The principal ideal (1 — Çp) in the ring Z[¢p] is prime and (1 — ¢p)|p (since 


(E iG) — Gp) = —p). This implies 


p(X, j(q)) = (X — IPX? — j(q)) mod (1 — G). 
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Let p(X, j) — (X — P(X? — j) = $n amX"™. The previous congruence shows that 
the coefficients am are all divisible by (1 — Çp), and since they are integers, they must be 
divisible by p. This proves the theorem. 














Definition. The equation ®y(X, Y) = 0 from the previous theorem is called the modular 
equation . 


Example 10.2. Let p = 2. The modular equation in this case is 
F(X,Y) =(X -Y°\(X? —- eo eB BI (EY Hog x + Y?)+ 
28.7 Gl -373XY + 283" - 56(X + Y) — 9305 = 0. 


For N = 3 the modular equation was computed by Stephen Smith in 1878 9 (few 
coefficients turned out to be wrong and corrected by Hermann, Crelle J. 274 (1973). It 
has the form 


F(z, y) = z(x +2” «3 -5°)3 + y(y +. 27-3 -53)3 — 23 y3+ 
23 . 3? . 31a7y?(a + y) — 2? - 33 - 9907xy(a? + y?) + 2-34-13 - 193 - 6367x7424 
216 . 35 . 53 . 17 - 263ay(x + y) — 23! - 56 . 22973xy = 0. 


Other cases where it was computed explicitly are N = 5,7,11. The last case took 
20 hours on a VAX-780. Itis a polynomial of degree 21 with some coefficients of order 
10°. 


Corollary 10.2. The modular curve Xo(N) is isomorphic to a nonsingular projective 
algebraic curve defined over Q. 


Proof. We assume that the reader is familiar with some basic notions in algebraic ge- 
ometry (first two chapters of [Shafarevich] suffices). The theorem says that Xo(N) is 
birationally isomorphic to the plane affine curve ®y (x,y) = 0 defined over Q (i.e. its 
equation is given by a polynomial with rational coeffcients). By homogenizing the equa- 
tion we obtain a projective curve defined over Q. Now we use the normalization process. 
Since this process can be done over the same ground field, the normalized nonsingular 
curve is also defined over Q. 














Remark 10.1. In fact, one can choose the equations defining Xo( N) with coefficients in Z. 
This allows one to reduce the coefficients modulo a prime number p to obtain a projective 
algebraic curve over a finite field Fp. It follows from the Kronecker congruence that the 
prime numbers p dividing N are “bad primes”, i.e. the reduction is a singular algebraic 
curve. One can show that all others primes are “good primes’, i.e. the reduction is a 
nonsingular algebraic curve. The reductions of the modular curve Xo( N) modulo a good 
prime p are examples of curves over a finite field with “many rational points” and are used 
in coding theory. 
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Definition. A holomorphic map between elliptic curves E’ — E is called an isogeny of 
order n if it is a homomorphism of groups whose kernel is a group of order n. 


Let E = C/A, E’ = C/A’. It follows from the definition that any isogeny f : E’ > 
E can be lifted to a map f : C > C,z > az such that f(A’) C A. The kernel of this 
map is the group a 'A/A’ C C/A’. So its order is equal to the determinant of the matrix 
A = (2°) such that 


aw = aw, + bwh, aw, = cw, + dwh. 


Here A = Zw, + Zw, A’ = Zwi + Zwi. We can change the bases to assume that A = 
(a a ) is diagonal with dı|d2 and djdz = n. The pair (dj, dz) is defined uniquely by 
the previous property and is called the type of the isogeny. The isogeny is called cyclic if 
dı = 1. In this case the kernel of the isogeny is a cyclic group of order n. 


Corollary 10.3. Let E, be a complex torus corresponding to the lattice Z + TZ. Then the 
set of isomorphism classes of elliptic curves admitting a cyclic isogeny f : E' > E of 
order N consists of the isomorphism classes of elliptic curves E, where 


®y(5(7'),5(7)) = 0. 


Proof. Let E' + E be a cyclic isogeny of order N. As we have explained before, 
replacing the curves by isomorphic curves, we may assume that 


E =C/Zu + Zwo, E' = C/Zw + ZNwo. 


Further replacing them by isomorphic curves we may assume that w1 = l,wg = T E€ H. 
Thus the isomorphism class of E is determined by the value of j at 7, and isomorphism 
class of F’ is determined by the value of j at Nr. But the pair (j(N7T),7(7)) satisfies 
the modular equation ®y (x,y) = 0. Conversely, if (j(7’),7(7)) satisfies the modular 
equation, then j(7’) = j(A- 7) for some matrix A = (4°) € Ay. This implies that 
E! = E4.,. Since T and A-7 are both in the upper half-plane, we must have 7’ = A-T = 
(ar + b)/d. Replacing Z + Zr’ with dZ + (at + b)Z which defines an isomorphic curve, 
we see that dZ + (ar +b)Z C Z+ Zr and hence there exists an isogeny E! + E, whose 
kernel is given by the matrix A. Since (a,b, d) = 1, the elementary divisors of this matrix 
are (1, ad). This shows that f is a cyclic isogeny. 














Corollary 10.4. Let tT E€ Q(/—d) where d is a positive rational number. Then the value 
j(T) is an algebraic integer. 


Proof. Let O be the ring of integers in the quadratic field Q(./—d). It admits a basis 1, w. 
Let a € O such that its norm N is square-free. Then 


aw =aw+b, a=cwd. 
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Here the matrix M = (2 b) has determinant equal to the norm of œ. Since the latter is 
equal to the determinant of the matrix and is square-free, we have (a,b,c,d) = 1. Also 
observe that w = A-w = wth, By Lemma 10.2, M = M'A, where M’ € T(1), and 
A € Ay. This shows that 





j(w) = j(M w) = j(M'A - w) = j(A-w). 


and hence j(w) satisfies the equation ®y(X,X) = 0. This equation is a monic polyno- 
mial over Z, so that j(w) is an algebraic integer. We can write nr = aw + £ for some 
integers n, a, 8. Since ®,(j(T), j(nT)) = 0, 7(7) is integral over the ring Z[j(n7]. So, it 
suffices to show that j (nT) is an algebraic integer. Since j(n7) = j(aw + 8) = j(aw) = 
j(—aw), we obtain, by the previous argument, that j (nT) is integral over j(w). Since the 
latter is an algebraic integer, j(7) is an algebraic integer as well. 














Remark 10.2. Notice that r € Q(./—d) if and only if the lattice A, has complex multi- 
plication (see Lecture 2). By Exercise 2.6 this is equivalent to that £, has endomorphism 
ring larger than Z. An elliptic curve with this property is called an elliptic curve with 
complex multiplication . Viewing j as a function on the set of isomorphism classes of 
elliptic curves, the previous corrollary says that the value of 7 at the isomorphism class of 
an elliptic curve with complex multiplication is an algebraic integer. 


Remark 10.3. The classical Kronecker Theorem asserts that any finite abelian extension 
of Q with abelian Galois group can be obtained by joining roots of unity to Q. Observe 
that a nth root of unity is the value of the function f(z) = e?7'*/" on Z. Let K be an 
imaginary quadratic extension of Q and let a be an ideal in the ring of integers of K. 
Then the set j (a) generates a maximal non-ramified extension of the field K with abelian 
Galois group. This is the celebrated *Jiigendtraum” of Leopold Kronecker which was 
proven by himself when he had passed his youth age. 


Corollary 10.5. A modular function f € C(j,jn) belongs to Q(j, jn) if and only if its 
Fourier expansion at œ has all coefficients in Q. 


Proof. Since j and jy has rational Fourier coefficients, we only need to prove the suf- 
ficiency. Let f = R(j,jn) where R = P(x,y)/Q(x,y) is a rational function with 
coefficients in C. Any automorphism o of C acting on C(j, jn) sends R to R7 by re- 
placing the coefficients of R with its c-conjugates. This is independent of the choice of 
R since the modular equation relating j and jy has coefficients in Q. Let f7 denotes the 
image of f under the action of ø. I claim that 
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where f(T) = X7 __, cnq” is the Fourier expansion of f at oo. Since 
p(N)-1 l 
CO in)= X CO), 
i=0 
it suffices to prove the assertion for f € C(j). Write 


a ao + aij +... + anj” 
bo + bij +... + bmj™ 





(10.6) 


Replacing f with f~! we may assume n > m. Multiplying by some integer power of 
j, we may assume that ao, bo # 0. Since ao/bo is equal to the value of f at oo, it must 
be a rational number. The difference (f — $°)/j has Fourier coefficients in Q, and has 
representation in the form (10.16) with smaller n. Continuing in this way we arrive at the 


case n = m = 0 where the assertion is obvious. 














10.3 Let us explain the meaning of the symmetry property of the modular equation. 
Consider the map H — H defined by the formula 7 + —1/NT-. It is easy to see that the 


matrix F = ( g as belongs to the normalizer of the group To(N) in SL(2, R), ie. 


FMF! € To(N) for any M € To(N). This implies that the previous map factors to a 
map of the quotient H/To(N) —> H/To(N). It can be shown using some basic algebraic 


geometry that it extends uniquely to a holomorphic map 
Fr: Xo(N) SF, X(N). 


Observe also that F? = —1 so that Fr? = identity. It is called the Fricke involution. By 
taking the inverse transform of functions, the Fricke involution acts on modular functions 
of weight k by 

Fr*(f)(7) = f(-1/N7) = (Nr)**f(N7). 


In particular, 
Fr°(j(7)) = 5(N7),  FY’G(N7)) = i(-1/7) = 57). 


This implies that the Fricke involution acts on the modular equation by switching X and 
Y. 


Remark 10.4. Let Xo(N)™ = Xo(N)/(Fr) be the quotient of the curve Xo(N) by the 
cyclic group generated by the Fricke involution. One can find all numbers N such that the 
genus of this curve is equal to 0. It was observed by A. Ogg that the list of corresponding 
primes is the same as the list of all prime divisors of the order of the Monster group, the 
largest simple sporadic finite group. This has been explained now. 
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Example 10.3. We know that R = A(2r)/A(r) generates the field M(X0(2)). The 
Fricke involution acts on this generator as follows: 


A(-1/7) ADP OAO yp 


M09 = ACI) > Be@n@n® ~ FART 





We know that every modular function with respect to the field [9(V) can be written 
as a rational function in j and jy with complex coefficients. In other words, it belongs 
to the field C(j, jy). The next theorem characterizes functions which belong to the field 


Exercises 


10.1 Prove that there exists exactly (N) isomorphism classes of elliptic curves admitting 
acyclic isogeny of order N onto a fixed elliptic curve. 


10.2 Let f : E’ — E be an isogeny between elliptic curves of order N. Show that there 
exists an isogeny f’: E — E’ of the same order. 


10.3 Show that the Fricke involution of H/To(N) sends the point representing the iso- 
morphism class of the pair (E, A) (E is an elliptic curve and A is its cyclic subgoup of 
orer N) to the pair (E’, A’), where E’ = E/A, A! = yE/A. 

10.4 Let f,g be two modular forms of the same weight with respct to [(1). Show 
that, for any A € Ay the function f(A - T)/g(T) is a modular function with respect 
to (A-IT(1)M) NT (1). 

10.5 Show that @(7(7),7(7)) = 0 for some N > 1 if and only if E, has complex 
multiplication. 

10.6 Let N = 2,3,5,11 and k = 12/(N +1). Show that the space of cuspidal forms 
Mg(To(N))? is spanned by the function (A(r)A(N7)) WaT, 

10.7 Let N = 2,3,6 and k = 6/N. Show that the space of parabolic forms M;,(I'(N))° 
is spanned by the function A(r)!/%. 

10.8 Show that M(X0(2)) = vaar a [Hint: use that f?4 = v/m? and apply the 
six crss-ratia formulas]. : f 

10.9 Generalize Example 10.1 by proving that the function ®(7) = (Ah) va gener- 
ates the field M(Xo(N)) for N = 2,3, 5,7, 13 [Hint: Check that N-t has one zero and 
one pole of multiplicity N — 1 and use the formula for the genus of Xo(N) to check that 
Xo(N) = P! (C)] 

10.10 A modular function f € M(X (T)) is called a Hauptfunction for T if it generates 
the field M(X (T)) and admits a Fourier expansion at the cusp oo (of index h) of the form 
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q1 + ms amq™/™, where am are integers. An example of a Hauptfunction is the 
absolute invarinat j. 
(i) Show that the functions (AX?) N5 are Hauptfunctions for the group [9(V) when 
N = 2,3,5, 7. 


(ii) Show that the function y?, where y is the parameter in the Hesse equation (see 
Problem 3.6) is a Hauptfunction for I'9(3). 


(iii) Show that the 274A is a Hauptfunction for T (2) (see Lecture 10). 


900(037)? +991 (O37)? 


(iv) Show that the function 4 Ty Ory? 





is a Hauptfunction for T (4). 


10.11 Show that the fundamental domain for '9(p) where p is prime, can be obtained as 
the union of the fundamental domain for I\(1) and its translates by transformations ST*, 
where k = 0,...,p. 


10.12 Find the expression of the absolute invariant j in terms of the generator ® of the 
field of modular functions for To(2). 


10.13 Prove that the cosets of (1) modulo I'o(V) can be represented by the matrices 


& | where (c, d) = 1, d|NmO < c < N/d. 
10.14 Prove the doubling identities: 
2919(27)? = Yoo(T)? =, Don (T)?, 


20o0(27)? = voo(T)? + vo (7). 
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Lecture 11 


Hecke Operators 


11.1 Let S and S’ be two sets. A correspondence between S and S” is a subset Z C 
S x S". For example, Z could be the graph T of a map f : S — S’. One can view Z 
as a multi-valued map from S to S’ as follows. Take s € S, and consider the intersection 
{s} x S'N Z. Then take the image Z(s) of this set under the second projection prg : 
S x S’ — S$". This is called the image of s under Z. We will assume that Z is a 
finite correspondence meaning that each set Z(s) is finite (maybe empty). Clearly, Z 
is completely determined by its images. When Z = I’, is the graph correspondence 
we obtain the usual value of the map on s. The analog of compositions of maps for 
correspondences is the following operation. Let Z’ C S’ x S” be another correspondence. 
Set 


Z'o Z =pri3((Z x S") x (S x Z"), 


where p13 is the projection map S' x S’ x S” > S x 8”. Itis called the composition of 
the correspondences Z and Z”. It is easy to see that the value of Z’ o Z at s € S is equal 
to the union Uyez/s)Z’(s’). In particular, when Z’ is a function f : S — S” (identified 
with its graph), we have f o Z(s) = f(Z(s)). 

One can view any finite correspondence as a map f : S —> P(S") fin, where P(S") fin 
is the finite Boolean of the set S’, i.e. the set of finite subsets of S”. Using the characteristic 
function of a set we can identify P(S”) pin with the set of functions with finite support 
which take values 0 or 1. Now let K be any commutative ring. For any set X denote by 
K~ the ring of functions X — K with finite support. Its basis consists of characteristic 
functions y;,; and can be identified with elements of X. This allows us to write its 
elements as finite linear combinations of elements of X with coefficients in K. We have 
encountered this notion when we defined divisors on Riemann surfaces. By including 0, 1 
in K we can identify any correspondence Z C S x S” witha function Z : S > K S" We 
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have 
Z(s) = ` 1- s. (11.1) 
s'EZ(s) 
Now we extend the notion of a correspondence by making the following: 


Definition. Let K be a commutative ring and let S, S’ be two sets. A finite K-correspondence 
on the set S x S’ is a function Z : S > KS. 


We have a natural function 


deg: KS > K, 65 X (s) (11.2) 
ses 
which is an analog of the degree of a divisor. If Z is a correspondence as in (11.1), then 
deg(Z(s)) = #Z (s), where Z is considered as a multivalued map. 


Since K’ is an abelian group with respect to the operation of addition of functions, 
we see that the set of finite K-correspondences on Ș x S” forms an abelian group. In par- 
ticular, take S = S’ and denote the set of finite correspondences on S x S by Corr(S) x. 
It has two operations: an addition and the composition. The latter generalizes the opera- 
tion of composition of correspondences from above. For any f : S —> K* denote by f its 
extension to a map KS — K® defined uniquely by additivity: 


fo; ass) = y as f(s). 
ses ses 


For any f,g € Corr(S) x we set 
f o g(s) = f(g(s)). (11.3) 


We leave to the reader to verify that this defines a structure of an associative ring on 
Corr(S) x. It is called ring of finite K-correspondences on the set S with values in K. In 
fact, it is obviously an algebra over K (since K’ is a K-algebra). When K = Z we skip 
the subscript in the notation. 

Let Z be a finite K’-correspondence on S x S’. Any function ¢ : S’ — R with values 
in a K-algebra R can be extended by additivity to a function d:K* oR using the 


formula 7 
o( `> ass’) = > asr(s’). 
SES SES 
This allows us to define the inverse transform of ọ under the correspondence Z: 


Z*(b) = 402. 
If Z(s) = Vo yes @s’s’, then 


Z*(4)(s) = X ayd(s’). (11.4) 


SES 
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Example 11.1. Let f : X — Y be a holomorphic map of compact Riemann surfaces. 
Define a function r : X — Z by r(x) = ramification index of f at x. Recall that this 
means that, taking a local parameter t at f(a), the function t o f has a zero at x of order 
r(x). Consider fT} as a correspondence on Y x X given by the inverse f7}. More 
precisely, f7! = {(y,r) € Y x X : f(x) = y}. Then 


does not depend on y and is equal to the degree of the map f. 


11.2 We will be interested in the following situation. Let S' be the set £ of lattices in C. 
Define a correspondence on £ as follows 


T(n)={(A, A) ELXL:A'CA,[A: A] =n}. (11.5) 
We take the natural inclusion of T(n) in the product £ x £. 


Lemma 11.1. Let A’, be the set of integral matrices E t) with ad = nand0 < b < d. 
Fix a basis (w1,w2) of a lattice A. For any A € Al, denote by A(A) the sublattice 
Z(awz + bwi) + Zdwə. Then the map A > A(A) is a bijection from the set Al, onto the 
set T(n)(A). 


Proof. Note that the set A’, differs from the set A, used in the previous lecture only 
by abandoning the primitivity property of the matrix. As in the proof of Lemma 2 in 
this lecture, we show that any integral matrix with determinant n can be transformed 
to a unique A € A’, by integral row transformations. This shows that any sublattice 
A’ € T(n)(A) has a unique basis of the form w| = dw, w = bw, + awe, and hence is 
equal to a unique A(A) with A € A’. 














Corollary 11.1. 
deg T(n)(A) =X a. (11.6) 


d\n 


For any nonzero complex number c consider the correspondence Re on £ defined by 
the function A > cA. 


Lemma 11.2. The correspondences T(n) and Re form a subring of the ring Corr(L). 
They satisfy the following relations: 


(i) Tim)oT(n) =T(mn) if (m,n) = 1; 


(ii) T(p") o T(p) = T(p"*1) + pT(p"™t) o Rp, where p is prime; 
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(iii) T(n) o Ra = Rao T(n); 
(iv) Rao Ry = Rap. 
Proof. The last two properties are obvious. To prove (i) we observe that 
T(n)o T(m) ={(A, A”) € Lx L:[A: A] =n, [A’: A”] =m _ for some A’}. 


If (m,n) = 1, the finite abelian group A/A” contains a unique subgroup of order m. Its 
pre-image in A must be A’. This shows that 


T(n)oT(m) ={(A, A”) € Lx L: JA: A”) = mn} =T(mn). 


This proves (i). We have 


[A:A/J=pn+t 
where 
an = HA" : [A : A"] = p, [AY : A] = p”}. 
Now 
Toa = J A 
[A:A/J=pr+2 
pT(p”"')o Rp(A) = pT(p”"')(pA)=p A bw, 
[A:A)]=pr+1 
where 


(11.7) 


1 if A’ Cpa. 
bse i 
0 if A’ ¢ pA. 


Comparing the coefficients at A’ we have to show that 
(a) aw = lif A’ ¢ pA; 
(b) an =pt Lif A’ C pA. 


Recall that a, counts the number of A” of index p in A which contain A’ as a sublat- 
tice of index p”. We have pA C A” C A. Thus the image A’ of A” in A/pA is a subgroup 
of index p. In case (a) the image of A’ in the same group is a non-trivial group contained 
in A’. Since the order of A is equal to p, they must coincide. This shows that A” in A/pA 
is defined uniquely, hence there is only one such A”, i.e. ay = 1. 

In case (b), A” could be any subgroup of order p in A/pA. The number of subgroups 
of order p in (Z/pZ)? is obviously equal to p + 1. 
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Corollary 11.2. The correspondences T(n) are polynomials in T (p)’s and 
Ry’s, where p runs through the set of prime numbers. In particular, T(n)’s and Ry,’s 
generate a commutative subring H of Corr. 


Definition. The subring H of Corr generated by the correspondences T(n) and Rp is 
called the Hecke ring of T (1). 


11.3 Consider a function f on £; using definition (11.4) , we have 


T(n)*(f)(A)= So f(A). (11.8) 


[A:A‘J=n 


We apply it to the case when f is defined by a modular form of weight 2k with respect to 
T. Choose a basis (w1, w2) of A with T = w2/w1 € H. Then set 


F(A) = (w1) ™ F(T). (11.9) 
This definition is independent of the choice of the basis as above. In fact, if wh = QW + 


Bur, wi = ywe + dw with some M = a € SL(2, Z), we have 


wr ATEI FM r) =u (7). 
This function satisfies the property 
flad) =a” f(A). (11.10) 


Conversely given a function f on £ satisfying this property we can set f(r) = f(Z+Zr). 
Then 





r+ mti) = (yr +8)” f((yr + 6)Z + (ar + B)Z) = 


(yr + 6) F(Z +72) = (yr +8)” f(r). 


By property (iii) of Lemma 1, we obtain that T(n) leave the set of functions fonH 
satisfying (11.6) invariant. 





Let Fy be the space of functions on £ of the form f where f € M(I'(1))r. 
Theorem 11.1. For any positive integer n and any non-negatve integer k, 


Try) CF. 
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Proof, Let f € M(I(1)), and f € Fy. We know that 
(iene: So fn eet NA): 
[A:A/J=n [A:A/J=n 


This shows that T(n) f = g, where g is a function on H satisfying g( e 3 3) T= (97+ 


6)~?* g(r) for any (° E) er . We have to check that g is a holomorphic function 
on H and at infinity. Applying Lemma 2, we have 


g(t) =T(n)f(Z+7Z) = D f((ar +b)Z + dZ) = 5o a ER oe 


AEA, AEA, 
Thus x4 
= aT 
= AIE IER A (11.11) 
AEA! AEA! 


Clearly, g is holomorphic on H as soon as f is holomorphic. It remains to find its behavior 
at infinity. Let 
OO 
f E a mer 
m=0 


be the Fourier expansion of f at oo. Then 


CO 
g= y gk Ya e2tim(ar +b) 19), 

AeA}, m=0 

Observe that 
, d ifdinm, 
5 e2Timb/d = | (11.12) 
o<bed 0 otherwise. 
This gives 
5 drt DD eigen a”) = 3 arte 3 Cmd” 
ad=n,a>1 m' EZ ad=n,a>1 m' EZ 


Now let m = am’ we have d = n/a, so we can rewrite it as follows: 
g= `> q™( 5 (n/a) ™ emna) = ys bigs (11.13) 
meZ a|(n,m),a>1 mEZ 


Since cg = 0 for k < 0 we get bm = 0 for m < 0, so that g is holomorphic at oo. Also we 
see that, if co = 0, then bọ = 0, i.e. T(n) maps a parabolic form to a parabolic form. 














135 


From now on we shall identify M (T (1))ẹ with Fẹ. So we have linear operators T(n) 
in each space M (T (1))ẹ which also leave the subspace M (I'(1))? invariant. 

To avoid denominators in the formulas one redefines the action of operators T(n) on 
the vector space M(T(1))ẹ by setting 


T(n)f = n™ T(n) (f) =n $) flA (11.14) 
AEA, 


These operators are called the Hecke operators. Let 


OO OO 
TOS ed) = Y ba (11.15) 
m=0 m=0 
It follows from (11.9) that for prime n = p, we have 


if l 
bmm P Fi im (11.16) 
Cmp +P" "Cm /p if plm. 


Also, for any n, 
bo = G2~-1(N)co, bı = Cn. (11.17) 


11.4 We will be interested in common eigenfunctions of operators T(n), that is, func- 
tions f E€ M;(I(1) satisfying 


T(n)f =A(n)f for all n. 


Lemma 11.3. Suppose f is a non-zero modular form of weight 2k with respect to T (1) 
which is a simultaneous eigenfunction for all the Hecke operators and let X` cnq” be its 
Fourier expansion. Then c # 0 and 


T(n)f = =f. 


C1 


Moreover, if co 4 0 we have 
Cn/C1 = oak- (N). 


Conversely, if co 4 0 and the coefficients Cy satisfy the previous equality, then f is a 
simultaneous eigenfunction of Hecke operators. 


Proof. In the notation of (11.11) we have 
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If c, = 0, then bı = A(n)cı = 0. But, by (11.12) we have cn = bı. This shows that 
Cn = 0 for all n Æ 0. Thus f is constant, contradicting the assumption. So, c1 Æ 0, and 
Cn = b1 = A(n )cı implies 

Aln) = cn/ c1. 


If co # 0, we use (11.12) to get bo = C2k—-1(N)co = A(n )co. This gives 


A(n) = oxx- (n). 














Corollary 11.3. Keep the notation from the previous lemma. Assume f is normalized so 
that cı = 1. Then 


Cmen = Cmn if (m,n) =1, 


2k-1 o 


prt 


CpCpr = Cpn+1 + p 


where p is prime and n > 1. 


Proof. The coefficient cn is equal to the eigenvalue of T(n) on Mg(T(1)). Obviously 
CmCn is the eigenvalue of T (n)T (m) on the same space. Now we apply assertion (ii) tak- 
ing into account that the correspondence Ry acts as multiplication by p 2* and remember 
that we have introduced the factor n?*—! in the definition of the operator T(n). 














Example 11.2. Let Ex(7) be the Eisenstein modular form of weight 2k, k > 2. We have 
seen in (6.21) that its Fourier coefficients are equal to 


2(27)* oop_1(n) 
pe - 





Ch n >l, 


22k-1qk B, 
co = 2¢(k) = Eo 


Thus cn = C1021 (N), and therefore E;,(7) is a simultaneous eigenvalue of all the Hecke 
operators. 


Corollary 11.4. 


F2~-1(M)ooK~-1(M) = Go~-1(mn) if (m,n) = 1, 


fr) ppr A=) 


T2k—1(P)o2k-1 (p) = O24—1(p O24-1(P 


’ 


where p is prime and n > 1. 
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Example 11.3. Let f = A. Since f spans the space of cusp forms of weight 6 and this 
space is T(n)-invariant for all n, we obtain that f is a simultaneous eigenfunction for all 
the Hecke operators. We have 


We see that the Ramanujan function n — 7 (n) satisfies 
T(m)r(n) = 7(mn) if (m,n) = 1, (11.18) 


t(p)t(p") = T(p"*!) + p''r(p"") ifpis prime and n > 1, (11.19) 


Recall from Number Theory that a function f : N —> C is called multiplicative if 
f(mn) = f(m)f(n) if (m,n) = 1. It follows from above that the Fourier coefficients 
Cn of any modular form which is a simultaneous eigenfunction of all the Hecke operators 
and normalized with the condition that cı = 1 define a multiplicative function. Example 
2 provides the function c2g—1(n). Of course, the fact that is multiplicative is well-known 
and can be found in any text-book in number theory. The fact that the Ramanujan function 
is multiplicative is not easy, and does not follow immediately from its definition. 


11.5 One can say more about the Fourier coefficients of a cuspidal modular form which 
is a simultaneous eigenfunction of Hecke operators. This is done by introducing an inner 
product in the space M(I'(1))?. 


Definition. Let f, g be two parabolic modular forms of weight k with respect to T. Let 
D CH be the modular figure. The formula 


=5 [fora g(7)Im (r)?*- žara = | f(a + iy)g(a + iy)y t ?drdy 


defines a Hermitian inner product in the space Mg(T)?. It is called the Petersson inner 
product . 


Observe that the integral converges because at the cusps f (T)g(T) behaves like O(e~Y) 
for some c > 0. This is why we have to restrict ourselves to parabolic forms only. 


Lemma 11.4. For any A = (2°) € GL(2,R) with det A > 0, 


(FlkA, g9kA) = (f, 9). 
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Proof. We have 





(FhAalea) = 5 f FAT) Cer + d)-*o(Ary(er + d) Im (r) -arar = 


; | f(Ar)g(Ar)ler + d|~**Im (7)"*~?drd? = 
D 


5 J FADIATIm (Ar)**-4a( Arya) = 


t T)g(T)Im (7)? -?drd7. 
5 Ji [Om (ara 


In particular, when we take A € T we get that in the definition of the inner product we can 
integrate over A(D) which is another fundamental domain for T. In fact, this computation 
shows that for any measurable subset Q of H and any A € I, we have 


i) f(r)g(r)im (r)? -?drd7 = i f(r)g(r)Im (7)?*-2drdz. 
D A(D) 
This allows one to view (f, g) as the integral of the differential form 

w = sf (r)g(r)im (r)? 2 drdt 


over H/T. Since for any A € GL(2, R) with det A > 0, the set A(D) is another funda- 
mental domain for I’, the see that the last integral in (11.14) is also equal to the integral of 
w over H/T. Hence, it is equal to (f, g). 














Theorem 11.2. The Hecke operators are Hermitian operators on the space M;,(T(1))° 


with respect to the Petersson inner product. 
Proof. We have to check that 
(T(n)f,9) = (f,T(n)9). 


In view of Lemma 2 it is enough to check it when n = p is prime. We have 


(T(p)f,9) = So Glo = So gA. 


AEA, AEA! 


Note that for any A € A; we have pA~! is an integral matrix of determinant p. Thus we 
can write as MA’ for some M € I'(1) and A € A}. This gives us that 


(T(P)f,9) = >> (fgleM A’) = So (frale’) =F TO 


AEA, A'EA, 
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Corollary 11.5. The space of parabolic modular forms M(T (1)? admits an orthonor- 
mal basis which consists of eigenfunctions of all the Hecke operators T(n). 


Proof. This follows from a well-known fact in linear algebra: a finite-dimensional Hilbert 
space admits an orthonormal basis of eigenvalues of any set of commuting normal opera- 
tors. 














Corollary 11.6. Let f be a cuspidal modular form which is a simultaneous eigenfunction 
for Hecke operators and let cn be its Fourier coefficients. Then Cp/ci are totally real 
algebraic numbers. 


Proof. The numbers c,,/c; are eigenvalues of a Hermitian operator. They must be real. 
To prove the algebraicity, let us consider the set M;,(Z) of modular form of weight k 
for ['(1) with integral Fourier coefficients. Examples of such forms are the normalized 
Eisenstein series Ey = -> ER (7). This set is a Z-module and invariant with respect 
to Hecke operators (as it follows from the formula for the Fourier coefficients of trans- 
formed functions). We can find a basis in this module which is a subset of monomials 
(E3)*(E%)°,a +b = 2k. Thus the egenvalues of T (n) being the roots of the characteris- 


tic polynomial with integer coefficients must be algebraic numbers. 














Exercises 


11.1 Let S be the set of finite-dimensional vector spaces over a finite field F} of q ele- 
ments. For each positive integer n consider the correspondence T(n) = {(V,W) :W c 
V,dim V/W = n}. Show that the operators T(n) generate a commutative subring of 
the ring of correspondences Corr(S). Show that T(n)T'(m) = k(n,m)T(n +m), where 
k(n,m) = #G(n,n+m)(F,) (G(n,n+m) is the Grassmann variety of linear subspaces 
of dimension n in Fg. 





11.2 Show that the Hecke operators T(n) together with operators Re generate a commu- 
tative algebra H over C which is freely generated by the operators T (p) and Rp, where p 
is prime. The algebra H is called the Hecke algebra of the group T (1). 

11.3 Show that the vector subspace of Corr(£)Q spanned by the Hecke operators T(n) is 
a subalgebra of Corr(L)g. 


11.4 Consider the formal infinite series X7} T(n)n~* with coefficients in the Hecke 
algebra H of I'(1). Show that 


X Tinj = J] -Tipp + Rp]. 
n=1 


p prime 
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11.5 Show that for any lattice L in C and a complex number s with Re s > 1, we have 
do #T(n)(L)n™ = 6(s)¢(s — 1), 
n=1 


where ¢(s) is the Riemann zeta function. 

Let I be a subgroup of finite index in (1) and A be a subsemigroup of the group 
GL(2, Q)* of rational 2 x 2-matrices with positive determinant which contains I and 
satisfies the property that, for any a € A, a -T -at AT is of finite index in T (e.g. 
T = [(1) and A = {o € M2(Z) : deto > 0}. Let H(T, A) be the free abelian group 
with the basis formed by the double cosets [o] = ToT, o € A. 


(i) Show that, for any o € A, the double coset [ø] is equal to a finite union of right 
cosets ['o;, where o; € A. 


Gi) If [o] = Uieroi, [o’] = Ul jesa}, let cf ,, denote the number of pairs (i, j) € 
I x J such that Tojo; = Ta for a fixed a € A. Show that the formula 


lb] l= So Bala 
a:TalCrolo'T 


togeher with the addition law defines a stucture of an associative ring on H(T, A). 
This ring is called the Hecke ring of (T, A). 


(iii) Let v be the adjugation involution in M2(Z) (i.e. (M)M = det(M)J2).Assume 
that A is invariant with respect to 1. Show that H (T, A) is commutative if and only 
if [(o)] = [o] for any o € A. 


11.7 Let S be the set of right cosets I - 0,0 € A. For any o € A set Zo = {(Ta, T8) € 
SxS:T-8BCcCToTa}. 


(i) Show that Z, depends only on the double coset |o] of o, so we can denote it by 
Zio): 


(iii) Show that [o] + Zj,) defines a homomorphism of the Hecke ring H (T, A) to the 
ring Corr‘ (S) of finite correspondences on the set S. 


11.8 For any o € A let T'o = (oro!) AT. Let m : H/T, —> H/T correspond to 
the natural inclusion T'ẹ C T and let 7, : H/T, — H/T be the composition of an 
isomorphism H/o~'To ~ H/T induced by the Méebius transformation T —> o - T and 
the natural projection map H/T, > H/o7!To. 


(i) Show that the composition of the correspondences 7 o 75! defines a finite corre- 
spondence C, on H/T. Here 75" is defined as in Example 1 from the lecture. 
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(ii) Show that Co depends only on the double coset ToT. Denote it by Cio). 


(iii) Show that Zio] —> Cjo] defines a homomorphism from the Hecke ring H (T, A) to 
the ring Corr(H/T). 


11.9 Consider the Hecke ring H(T(1), M2(Z)*). For any pair of positive integers a,b 
with a|b denote by T (a,b) the double coset of the matrix (4°). For any positive integer 


n set T(n) = X ap-n 1 (a,b). 
(i) Show that T (a, b)T(a’, b’) = T(aa’, bb’) if (b, b') = 1. 
(ii) Show that T(p*, p™) = T (p, p)T(p*—!, p™—1), where p is prime. 


(iii) Show that there exists an isomorphism of algebras H (T (1), M2(Z)*) @ C and the 
Hecke algebra H of T (1) as defined in Exercise 11.2. Under this isomorphism each 
element T(n) is mapped to the Hecke operator T(n), and each element T (a, a) is 
mapped to the operator Ra. 


11.10 Let A(N) C Mo(Z)* be the set of integral matrices with positive determinant 
prime to N. Prove that the map [(N)oI(N) — T(1)oT(1) defines an isomorphism 
from H(I(N), A(N)) onto H([(1), M2(Z)o*). 

11.11 Let N > 1 and A be a fixed subgroup of (Z/NZ)*. Let A be the semigroup 
of matrices ø = (28) € Ma(Z)* such that (det o, N) = 1, N|c and the image of a 
in Z/NZ belongs to A. Let I be the group of invertible elements in A. For example, 
T = To(N) or P(N). Consider the Hecke ring H(T, A). For any d € (Z/NZ)* let og 
denote any representative of (g soit) in SL(2,Z). For any pair of positive integers a,b 
such that a|b and (b, N) = 1, denote by T (a, b) the double coset of the matrix (9). For 
any positive integer n let T(n) be the sum of the double cosets Zio} Where deto = n. 


Show that 


(i) any Zo € H(T, A) can be uniquely expressed as the product T(m)T (a, b), where 
each prime factor of m divides N (we write it as m| N°); 


(ii) if (m,n) = 1 or m| N” or n| N°, then T(mn) = T(m)T (n); 


(iii) A(T, A 
and T (p 


is a polynomial ring over Z in the variables T (p, p) for all primes p {N 
for all prime p; 


Ne 


(iv) H(T,A 


NS 


® Q is generated as an algebra over Q by T(n) for all n; 


(v) the map H(I'(1), M2(Z)*) > H/(T, A) defined by sending T(p) to T(p) if p is 
any prime, T'(p,p) to Tp, p) if p is a prime with p AN, and sending Tp, p) to 
zero if p is prime with p|N, is a surjective homomorphism of rings; 
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(vi) H(T, A) is a commutative ring. 


11.12 Define the action of [o] € H(T, A) on M;,(L) by fllo] = det(o)* 7! 5; fleo: 
where [o] = U;T'o; and f|,0; is defined as in (6.5) (which applies to not necessary uni- 
modular matrices). 


(i) Show that, extending by linearity, this defines a linear representation T — T* of 
the ring H(T, A) in Mg(T) and in M;(T)?°. 


(ii) Let (T, A) be as in Exercise 11.11. Show that, for any n > 0 and f E€ M;(T), 


T= E Flaca J 


ad=n,(a,N)=1,0<b<d 


11.13 Let us identify the set of points of Xo(V)! = H/T (NV) with the set of isomorphism 
classes of pairs (E, H), where F is an elliptic curve and H is acyclic subgroup of order N 
of its group of N-torsion points (see Theorem 8.6). Let p be a prime number not dividing 
N and let Tp) be the Hecke correspondence on Xo(N Y (see Exercise 11.7). Show that 
T(p)((E,H)) = {(E/A;, Ai + H/A;),i = 0,...,p}, where Ao,..., Ap is the set of 
cyclic subgroups of order p in E. 


Lecture 12 


Dirichlet Series 


12.1 A Dirichler series is an infinite series of the form 


o0 
an 


ns’ 
n=1 


where s is a complex number. It absolutely converges for Re s > 1 + c, where 
an = O(n°). 


An absolutely convergent Dirichlet series in a domain D is a holomorphic function in D. 
The most notorious example of a Dirichlet series is the Riemann zeta function 


Si 
C(s) = a 


n=1 


It converges for Re s > 1. We will be interested in Dirichlet series for which the coeffi- 
cients aņ are the Fourier coefficients of a modular form. 
Let f e€ M(P), and let 


f = De ane? /h (12.1) 


n=0 


be its Fourier series at oo. For any complex number s we define the formal expression 


OO OO 
Z f(s) = `> a = > age Pee (12.2) 


n=1 n=1 
and call it the Dirichlet series associated to f. Let us first invesigate the convergence of 


this series. 
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Lemma 12.1. Let f € Mg(T). Then f is parabolic if and only if 
|f(a + iy)| < Cy* (12.3) 
for some constant C independent of x. 


Proof. Let (a + iy) = |f(a + iy)|y*. It is immediately seen that this function is T- 
invariant. Let a be a representative of a cusp with respect to IT. Choose A € ['(1) such 
that A - a =o. Then f|,4 = ®(e?7'7/") for some function ® holomorphic in a domain 
Re r > c. We also have $(A- (x + iy)) = |®(e2™*-¥)/")|yk_ Assume f vanishes at 
a. Then & = €27?—v)/hGo, where limy so Po # 0. Thus limy; (A+ (x + iy)) = 
limy—+o0 e-?™¥yk = 0. This implies that the function ¢(a + iy) converges to zero when 
T = x+y converges to a cusp. Hence it is a continuous function on a compact topological 
space H*/T. It must be bounded. Conversely, if the inequality (12.3) holds, then ¢(a+7y) 
must be bounded and hence ® must be vanishing at 0. 














Corollary 12.1. Let f € M,(T)? and an be the coefficient at e27'"/" in its Fourier 
expansion at oo. Then 


|an| = O(n”). 
In particular, Z+(s) converges for Re s > k +1. 


Proof. Let q = e27'(*+9)/h, Fix y and let x vary from 0 to h. Then q moves along the 
circle C(y) of radius e~?7y/h with center at 0. By Cauchy’s residue formula 


1 


an = 5 
271 


1 f” , 
[fords = | Fet iyan. 
C(y) h Jo 
By Lemma 12.1, | f(a + iy)| < Cy7* for some constant C. We have 


1 hes ee ae -2k ,2nn 
janl <5 f IFE Hio) ae < Cy Perm 














Taking y = 1/n, we get |an| < Mnë. 


12.2 We shall now find a functional equation for the Dirichlet series Z;(s). 


Lemma 12.2. Let f € M,(T) and Fy = ee a J; Assume that 1! = Fy! -T - Fy 


is a subgroup of finite index in SL(2, Z). Then 
Wn (f) := flkFn = F(-1/NT)N "r © MT’). 


Moreover, if f € Mg (T)?, then Wy (f) € Mg (T. 
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Proof. For any A € I” we have Fy A = BF for some B € T. Hence 
Wn(f)lkA = (fleFn)|zA = flaFn A = 


flkBEn = (f|kB)|kFn = f|kFn = Wn(f). 


We leave the proof of the last assertion to the reader. 














Example 12.1. Let f € My(To(n)). Assume that N|n. Then 
Wyn (f) = f(-1/Nr)N7*r-** € Mi(To(N)). 


To see this we use that 


(le SVC te WA) = (ay Tenn 
(12.4) 


The same equality shows that 
fe Mk (n)) = Wn(f) € ME (n/N) AT o(nN)). 


Theorem 12.1. (Erich Hecke) Let f € M,(T)? and let g = Wy(f). Assume that 
Fy! -T - Fy is a subgroup of finite index in SL(2,Z). Let h be the index of the cusp 
oo of T and h’ be the same for 1’. The Dirichlet series Z+(s) can be extended to a 
holomorphic function on the whole complex plane. Setting 


R(s, f) = N*?(2n)*T(s)Z5(s), 
we have the functional equation 
h° R(s, f) = (—1)*R™* Rk — s; 9), 
Here T (s) is the Gamma-function. 


Proof. We shall use the Mellin transform which carries a function ¢(y) defined on the 
positive ray of real numbers, and bounded at 0 and oo, to a holomorphic function M o(s) 
defined by Mọ = F, where 


F(s) = i o(y)y® ‘dy. 


It is inverted by 
yt+too 


oy) = = | F(s)y *ds, y>0. 
y 
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Take @(y) = f (iy) and let f = $`] ane?™"”7/" be its Fourier expansion at oo. We have 


OO 
co 
=>) ty fe y = 
n=l 0 


oO 


an fe —tys-1 ae = (h/2n)T(s)Z;(s). 
n= 0 


Here we have used the integral formula for the Gamma-function: 


co 


= | er ldt. (12.5) 
0 
We leave to the reader to justify the possibility of the term-by-term integration of the 


infinite series (we have to use Lemma 12.2). Now let us do the same for the function 
g = Wyn (f) € Mg(I’)°, where I’ = Fy tT Fy. We have 


A leo) 
=| Hone va = | stented 


where the first summand converges for Re s > k + 1 and the second one converges 
everywhere. The Fricke transformation transforms f (iy) to f(i/Ny) = N*(iy)?*g(iy). 
So changing the variable y to 1/Ny we obtain 


A oo 
[a iy)y w= jn i/Ny)N yt dy = (DENE | gliu) dy. 
0 A 
(12.6) 
This converges for all s € C. Similarly, 

oo A 

E iy)y®™ dy = (DENE | glij- “dy, 

A 0 


This converges for Re s > k + 1. This shows that each summand in (12.6) can be 
holomorphically extended to the whole complex plane. After summing up we get 


M6(s) = (h/2)*T(s)Z p(s) = (—1)"N* *Mg(iy) (2k — s) = 


(—1)*N*-*(h! /20)?*-*T (2k — 8)Z, (2k — 8). 


147 


Thus if we set R(s, f) = N*/?(2m)~*P(s)Z(s) we obtain 
h°R(s, f) = (—1)*h’* ROR — s, 9) 











forRes>k+1. 





It follows from Example 1 that the Fricke transformation Fy defines a linear operator 
Wy on the space Mg(To(N)) . It satisfies 


We =1. 
In fact we have 
WACS) = Wn( NET f(-1/N7)) = Nr -*N-K(—1/N7)™* f(r) = f(r). 
Thus we can decompose M;(T'o(NV)) into the direct sum of two eigensubspaces 
Mi(Vo(N)) = Ma(Po(N))+ 6 Mk (To(N))- 


with eigenvalue +1 or —1. Similarly, we see that Wy acts on the space M;([(NV)) and 
we can decompose it in the direct sum of two eigensubspaces: 


Mx(E(N)) = Mk (T(N))+ 8 Me (T(N))-. 
Corollary 12.2. Let f E€ Mk (To(N))e where e = +1. Then 
R(s; f) = (-1)*eR(2k — s; f). 
Corollary 12.3. Let f € Mk (T(N))e where e = +1. Then 
R(s; f) = (-1)*N** **eRQk — 5; f). 








12.3 If f € M,(T) is not a parabolic modular form we cannot, in general, attach the 
Dirichlet series to it. However, if we assume that f admits a Fourier expansion at oo with 
coefficients satifying |an| < n° we can still do it and obtain a holomorphic function Z (s) 
defined for Re s > c. The next theorem generalizes the previous theorem to this case. 


Theorem 12.2. Let f € M;(I) and g = Wn(f) € M;,(I’) where I’ = Fy! -T - Fy is 
a subgroup of finite index in SL(2, Z). Let 


oo oo 
: ta Nia 
f= X ane R, g= X er 


be the Fourier expansions at f and g at oo. Assume that |an], |bn| < O(n°). Let 
R(f;s) = N*/2(Qn)—8T'(s)Z(s). Then Z+(s) is holomorphic for Re s > c + 1 and 
R(f;s) + ags~! + (—1)*bo(2k — s)~! admits a holomorphic extension to the whole 
complex plane. Moreover, 


hê R(f; s) = (~1)¥ W? Rf |p Fy; 2k — 8). 
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Remark 12.1. It is known that the Gamma-function T (s) is meromorphic and has a simple 
pole at s = 0. Thus, in Theorem 12.2, Zs(s) admits a meromorphic extension to the 
complex plane with single pole at 2k. 


Example 12.2. Take f(T) = Eə%(T). Then 


a 
iy) =2c0n 4 22 Aer D onal i(n)q”, 


n=1 


2k—1 
O24-1(N 2 


where 


It is easy to see that 


for some positive constant A. Thus Z;(s) is defined and is convergent for Re s > 2k. 


Since a = 
X oaa (n)n = y r-i (lm) = 
n=1 m,l=1 
XO mt-st! = ¢(s)¢(s — 2k + 1), 
m,l=1 
we have | jat 
2(2ri 


Recall that Eo(T) € M (T (1) = M;(To(1)). Applying Theorem 12.2, we obtain 
2k—2s T (2k — 8) 
Ts) 


Of course it follows also from the known functional equation for the Riemann zeta func- 
tion 


C(s)¢(s — 2k + 1) = (27) 





¢(2k — s)¢(1 — s). 


—)¢(1 =). 


Example 12.3. Take f(r) = @(0,7)°*. We know that these functions are modular forms 
of weight k = 2t for T (2). We have 





m3D(5)¢(s) = nF T( 


oo 
= y cgi(n)e"”, 
n=0 


where 
csin) = #{(r1,... rgt) € Z : n= r? AEE = r}. 
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It is clear that we can bound cg;(n) by the number of integer points inside of the cube 
[—V/n, —\/n]**. This easily gives 


csn) < Cn* = Cn?*. 
Therefore, the Z+(s) is convergent for Re s > 2k + 1. We have 


oo cer(n) 1 _ 1 
BO De i Pan EE a OOP 


m=1 (r1, rgt) EZSt\ {0} i "et AEA\{O} 











where Q = z? +... + ree and A = Z% C R*. More generally, for any positive definite 
quadratic form Q : R” > R and a lattice A in R” we can define the Epstein zeta function 


1 
AE A\ {0} 
Although f(T) is not a modular form for T (1) it satisfies f(—1/r) = f(7)r**. Applying 
Corollary 2 to Theorem 1 with N = 2 we get 


At—s 


T (Qn) “#+ST(4t — s)Z;(4t — $) 





2928/2 (22) T (s) Zp (s) = 2**-82 


which gives 
gét 2s T(s) 
2%t-s T(4t — s) 





Z 7 (At s) _ Z (8). 


12.4 Now let us look at the Dirichlet series associated to cuspidal forms which are 
simultaneous eigenfunctions of Hecke operators. 


Theorem 12.3. Let f be a normalized cuspidal modular form of weight k with respect 
to T (1) and ` cnq” be its Fourier expansion. Assume f is normalized in the sense that 
cı = 1. Assume that f is an eigenfunction for all the Hecke operators. Then the associated 
Dirichlet series Zs(s) admits the following infinite product expansion: 





1 
Z5(8) = lI (1 — cyp—8 + p21 2s) ° 


p prime 


Proof. We know from Corollary 11.3 that the function n — cn is a multiplicative func- 
tion. This implies that for any finite set S of prime numbers 


co 
z 1 
> - =, Wo Cpmp ™) = Il (1 — cyp-* + p21 ps)’ 


neEN(S) pES m=0 pes 
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where N(S) denotes the set of natural numbers whose prime decomposition involves only 
numbers from S. Here we use Corollary 11.3 which gives us that 


CO 
(1 — cpp? + pp) S$ qmp™*) = 1. 
m=0 


When S' grows, the left-hand side tends to Z(s). This implies that the infinite products 
converges to Z f(s). 














Example 12.4. Take f = A to obtain 








= = T(n) 1 
Za =) ns II (L—7(p)p-? + pp")? 


p prime 


where T(n) is the Ramanujan function. Applying Corollary 2 with N = 1, we get also 
the functional equation for Z4 (s): 


T (s5) 


Za (12 — s) = On a 


Za (s). 
Remark 12.2. Let 
sp =1- oT +p*T? = (1— aT)(1 — T). 


We know that a, and a, are algebraic integers. The Petersson conjecture suggested that 
— 


Qp = Mp, or, equivalently, 
lop] = op] = p=, 
or 
lepl < 22, 
or 


|En] < ně- oo(n) forall n > 1. 


This was proven by P. Deligne as a special case of his proof of Weil’s conjectures about 
the zeta function of algebraic varieties. In particular, when k = 6 we get the Ramanujan’s 
Conjecture: 


I7(p)| < 2p. 
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12.5 In this section we generalize some of the previous results to the case when ['(1) 
is replaced with T1 (N). We will be rather sketchy and refer for the details to [Seminar]. 
We use the definition of the corresponding Hecke ring H (T, A) from Exercise 11.11. Let 
us denote it by Ty. It is generated by the elements T(p) for all prime p and elements 
T (p, p) for all primes p not dividing N. Let TO) denote the subring of Ty generated by 
T(p) and Tp, p), where p does not divide N. One can extend the proof of Theorem 11.2 
to show that TO?) acts in the space M, (T1(N))? by Hermitian operators (with respect to 
the Petersson inner product). This is not true for the ring Ty. So a cuspidal form could 
be a simulateneous eigenfunction for all the Hecke operators coming from T? but not 
an eigenfunction for some Hecke operator from T y. 

It is easy to see that T(N) is a normal subgroup of To(N) with the quotient group 
isomorphic to (Z/NZ)*. Thue latter group acts naturally on the algebra of modular forms 
with respect to r'o(N), and for each k > 0 we have a direct sum decomposition into the 
eigensubspaces corresponding to Dirichlet characters x : (Z/NZ)* — C*: 


Mi(Ti(N)) = OyxMi (Ti) x; (12.8) 
Let 


M(To(N);x) = {f E€ Me(Po(N)): flag = x of, Yg €To(N)}, 


where x’ is the composition of x with the homomorphism To(N) — (Z/NZ)* which 
sends a matrix to the residue modulo N of its first coefficient. We will also need the 
notation 


Myx (To (N); xX)? = Ma(To(N); x) N Ma(To(N))?. 
We have 
Mi (Ti(N)y = Mk (To(N); xX). 


Clearly the subspace Mg(To(N) C Mg(T1(N) corresponds to the trivial character. 

More explicitly, the action of (Z/NZ)* on M;(T1(N)) is defined as follows. For 
any n € (Z/NZ)*, let an be any element of SL(2, Z) such that an = (7 ae ) modulo 
N. Then the action of n on M;,(T1 (JV) is given by the formula 


<n >k: f > flran. (12.9) 


Notice that the Hecke operator T (n, n) acts on Mg(T1(N)) as n*-? < n >. 
We have the following analogue of Theorem 11.2: 


Theorem 12.4. Let T(n) € T°) with (n, N) = 1. For any f, g E€ Mg(To(N); X), 
(T(n) f, 9) = x(n) (F, T(n)9), 


where the inner product is the Petersson inner product. In other words, the adjoint of 
T(n) is Tao < n mod N >. 
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It is easy to see that the operators T(m), (m, N) = 1 and < n > forma set of com- 
muting normal operators on M;,(['1(V)). This allows to decompose each M;(T'o(V); x) 
into an orthogonal sum of TA )-eigensubspaces. 

The condition (n, N) = 1 is important. The operators T(n) for which n does not 
satisfy this condition are not normal operators. So, it becomes problematic to find a 
modular form which is a simultaneous eigenfunction for all the Hecke operators. 

Another unfortunate thing is that the operator Wy does not commute with all the 
Hecke operators, so that we cannot combine Theorem 12.3 and Corollary 12.3 to ob- 
tain Dirichlet series Z+(s) with the infinite product as in Theorem 12.3 which satisfy the 
functional equation as in Corollary 12.3. 

We have the following weaker assertion: 


Proposition 12.1. Let Wy be the operator on Mg(T1(N))? corresponding to the Fricke 
transformation Fy defined by f > f|k fe NO rie Let T(n)k x denote the restriction of the 
Hecke operator T (n), (n, N) = 1 to the subspace M;(To(N); x). Then 


T(n)k,x ° Wn = x(n)Ww ° T(n)e,x, 
where X denotes the complex conjugate character. 
Proof. We refer for the proof to [Shimura]. LJ 


However, one can still find common eigenvalues in M;,(['1(N))° for all the Hecke 
operators if we restrict these operators to a certain subspace. Let us explain this. 
Let d, M be positive integers such that dM |N. There exists an injective linear map 


tam, N : Mg (01(M))° > Mz (T1(M))°. (12.10) 


It is defined by sending f(T) to d27! f(dr). One checks that it is a homomorphism 
of T(’)-modules. Let M,(1(M))°), be the subspace of M;(I'1(M))° spanned by 
the images of the maps za,m,n. Let Mp(T1ı(M))}eu be the orthogonal complement of 
Mi; (1(M))°,, with respect to the Petersson inner product. In fact, we have an orthogo- 
nal decompositions 


Mx(Pi(M)) oa = ®xMa(Po(M); x)oas 
where Mg(To(M); x) og = Mg (To(M); X)? O Mg(To(M))? 4, as well as 
Mk (To (M); X)? = Mk To (M); Xora ® Mk (ToM); new: 
The next result, due to Atkin and Lehler, is called the Multiplicity One Theorem. 


Theorem 12.5. Let f € M (T1(N))L eu. Suppose that f is an eingefunction for all the 
Hecke operators from TOP), for some D > 0. If g is another such form with the same 
eiegenvalues, then g is a scalar multiple of f. 
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Corollary 12.4. Let f € M;(Ti(N))°..,. The following assertions are equivalent: 
(i) f is an eigenfunction for TAP) for some D > 0; 
(ii) f is an eigenfunction for T); 
(iii) f is an eigenfunction for Ty . 


Proof. It follows from the theorem that each TOY )-eigensubspace in Mg (T1 (N))} ou 
is one-dimensional, and hence is T y-invariant because all the Hecke operators commute 
(Exercise 11.11 (vi)). This shows that (i) implies (iii). The rest of implications are obvi- 


ous. 














Remark 12.3. Let f = $` anq” be the Fourier expansion of a f € M;(Ti(N))°.,, 
satisfying one of the equivalent conditions of the previous corollary. One can show that 
a, Æ 0 so we can alaways normalize f to assume a, = 1. Such a modular form is called 


a newform. 


So we can extend Theorem 11.2 to newforms. To see when newforms exist we ob- 
serve that the maps zg,7,n send Mg(To(M); x)° to Mx (To(N); x’)°, where x is the 
composition of x : (Z/NZ) — C* with the natural surjection (Z/NZ) —> (Z/MZ). So, 
if x is a primitive character of (Z/NZ), we have 


M; (To(M); X)? = Mg (To(M); x) pew: 





We can apply Corollary 12.3 to get a functional equation for newforms. Notice that 


the space M (T1(N))} ou is invariant with respect to the operator Wy. This follows from 


the Wy-invariance of the space Mp(T1(N))®}q- The latter is easy to check. We have, for 
any f € M;(T1(M))° such that N = dM, 


Wam (ta,m,n(f(7)) = Wy(d2~! f(dr)) = 
(dM)~*d2—}(7)~2* f(—1/Mr) = damn (Wulf). 2.11) 


This checks the claim. 
It is also easy to see that 


Wy (Mg (T(N); X) pew) = Ma(Fo(N); X) pew: 


In particular, we can decompose M;(I'9(V))? 


new ito a direct sum of eigensubspaces of 
Wy: 


Mi (Fo(N)) pew => Mgl T(N) pew, + ® Mi (Fi(N))pew,—- 


An element of each space will satisfy the functional equation from Corollary 12.3 and 
also will admit the infinite product decomposition from Theorem 12.3. 
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Exercises 


12.1 Show that the Mellin transform of the function f(x) = V(0;ix) — 1 is equal to 
2n-*C(2s)T(s). 

12.2 Show that the Dirichlet series }*°°_, ann~* can be expressed as the Laplace trans- 
form fo f(t)e~*‘dt for an appropriate function f(t). 

12.3 Find the functional equation for Z; where f(T) = A(117)/A(r)!/1? (see Exercise 
10.6). 

12.4 Show that E2(T) — pE2(pr) belongs to Mı(To(p)), where p is prime. 

12.5 Prove Theorem 12.2. 


12.6 Apply the proof of Theorem 1 to the function f(T) = Voo(0; 7) to obtain the func- 
tional equation for the Riemann zeta function. 


12.7 Prove that for any f = $` anq” € Mg(T(1)) one has |an| < O(n?*-1). 
12.8 Show that the discriminant modular form A € Me(T) c Me(To(N)) is an eigen- 
function for all Hecke operators from T'y but not for all Hecke operators from T y (unless 
N = 1). 
12.9 Describe the decomposition of Mı (T1(33))? into the old and new subspaces by 
verifying assertions (i)-(iii) below. 

(i) dim.M (I, (33))° = 21, dim Mı (T1 (11))° = 1, and M,(T'1(3))° = 0; 

GD dim Mı (T1(33))} a = 2; 


i dim M1(T0(33); x)... = 2 for each nontrivial character x. 


new 


(iv) Show that each M1 (T9(33; y))2..,, is spanned by T33-eigenfunctions. 


new 


Lecture 13 


The Shimura-Tanyama-Weil 
Conjecture 


13.1 In the previous lecture we have attached a Dirichlet series to a cuspidal modular 
form with respect to the group ['9(JV). In this lecture we will attach a Dirichlet series to 
an elliptic curve over Q. The conjecture from the title of the lecture tells that the latter 
Dirichlet series always coincides with the former one for an appropriate modular form. 

Let E be an elliptic curve. We assume that it can be given by homogeneous equations 
with coefficients in Q and the set of points of Æ(Q) with rational projective coordinates 
is not empty. We say in this case that E is an elliptic curve over Q. One can show that the 
set E'(Q) is independent of the choice of a system of algebraic equations over Q defining 
E. 


Lemma 13.1. Let E be an elliptic curve over Q. Then E is isomorphic to a plane cubic 
curve with equation 
Y?Z — X? — oX oF =0 (13.1) 


with integer coefficients C2, C3. 


Proof. We use the Riemann-Roch Theorem from Lecture 8. Let D = }> npP bea divisor 
which is a linear combination of points from E(Q). Let L(D)g denote the Q-subspace 
of L(D) which consists of rational functions on E with coefficients in Q. Once can show 
that dimo L(D)g = dime L(D). Fix a point Q € E(Q) and apply the Riemann-Roch 
Theorem to obtain that dimo L(nQ) = n. Let x be a non-constant function in L(2Q) 
and let y € L(3Q) which is not a linear combination of 1 and x. Since the functions 
1,2, x7, x,y, y?, ry belong to the space L(6Q) and the latter is of dimension 6 over Q, 
we obtain a linear relation 


ao + a£ + ar? + azr? + boy + bixy + boy? =0 
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with coefficients in Q. Replacing x with ax+b and y with cy+dx-+e for some appropriate 
coefficients a, b, c, d,e E€ Q we may assume that the linear relation has the form 


b3 + box + x? — y? =0, 


where bo, b1 € Q (see Example 6.4). Multiplying x by a7? and y by a~? for an appro- 
priate integer œ, we can change bz to boa“ and b3 to baf. Choosing an appropriate a 
this makes we can assume that the coeffients cy = bya* and c3 = b3a° to be integers. 
Using the argument from the second half of the proof of Corollary 8.5 we obtain that the 
functions x, y define an isomorphism from E \ {Q} —> C \ {oo}, where C is the plane 
cubic given by the equation (13.1), and oo is its point (X, Y, Z) = (0,1,0). This can be 
extended to an isomorphism E = C. 














Observe that E can be given in many ways by an equation of the form (13.1). We can 
make it almost unique if we require some additional property. Let 


A= 4 + 27¢ (13.2) 


be the discriminant of the polynomial t? + cət + c3. We call it the discriminant of the 
equation (13.1). For every prime p let v,(A) be the highest power of p which divides A. 
We say that the equation (13.1) is a minimal Weierstrass equation of E if for any other 
equation of the form (13.1) defining E with discriminat A’ we have, for any prime p, 


vp(A) < Up(A’) 


One can prove that a minimal Weierstrass equationt always exists and is unique (see [Sil- 
verman)]). 


Definition. Let E be an elliptic curve over Q and let (13.1) be its minimal Weierstrass 
equation with discriminant A. Let p be a prime number. We say 


(a) E has good reduction (resp. bad reduction) modulo p if p JA (resp. p|A), 
(b) E has multiplicative reduction modulo p if p|A but p { c2c3, 
(c) E has an additive reduction modulo p if p|cz and p|c3. 


Let us explain the terminology. Since the coefficients cy and c3 are integers we can 
reduce them modulo p to obtain an algebraic curve over the finite field F,,. This curve is a 
singular curve (i.e. the formal partial derivatives of the polynomial defining the equation 
has a common zero over the algebraic closure F, of Fp) if and only if p|A. If p|cz and 
p|cg the equation over F, becomes Y?Z — X°’ = 0. Its singular point is (0,1, 0), and its 
nonsingular solutions (x, y, 1) over F, are of the form (t?,t),¢ € Fp. The addition law 
in F, defines the addition law on the set of nonsingular solutions equipping this set with 
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the structure of an abelian group isomorphic to the additive group of Fp. Finally, if Æ has 
multiplicative reduction modulo p, then after reducing the coefficients cz and c3 modulo 
p we obtain an algebraic curve over Fp which is isomorphic over Fp to the curve 


Y?Z — (4 67) =0 (13.3) 


with a Æ 0. The point (0,0, 1) is its singular point. Any nonsingular solution over F, 
has the form (to(t? + at2), tı (t? + at), to), where (to, t1) € P'(F,) and t? + at? # 0. 
The linear transformation uo = to + va, u1 = to — va allows one to identify the set of 
nonsingular solutions with the subset P!(F,,) \ {0, o0} = F*. So this set carries a natural 
structure of an abelian group isomorphic to the multiplicative group of the field Fp. 


13.2 Now we are ready to define the L-function L(E, s) of an elliptic curve over Q. It 
is given as an infinite product 


L(E,s) = [| (4.5), (13.4) 
p prime 
where 
(a) if E has a good reduction modulo p 


1 
1—a(p)p-§ +p 





Ly(E, s) = 1—2s’ 


where 
a(p) =p+il=— #E(Fp), 
and E(Fp) = {(x, y, z) € P?(Fp) : y?z = 23 + e282? + c32°}. 


(b) if E has multiplicative reduction modulo p 


L,(E, 8) = me 
where a(p) = 1 if a in (13.3) belongs to F, and A(p) = —1 otherwise. 
(c) if E has additive reduction modulo p 
Lp(s)=1. 


The next lemma shows that L(E, s) is a Dirichlet series. 


Lemma 13.2. The infinite product ||,,(1 — cpp *)~' with |cp| < p® for some real a 
defines an absolutely convergent Dirichlet series for Re s > c+ 1. 
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Proof. Let cn be a multiplicative complex-valued function on N with the value at a prime 
p equal to cp. We have a formal identity 


Sg 1 
NM 





Since |cp| < p%, we have |cn| < n“ for all n. We know from Lecture 12 that this implies 
that the Dirichlet series is absolutely convergent for Re s > c+ 1. 














Corollary 13.1. The infinite product L(E, s) converges for Re s > 2 and is given there 
by an absolutely convergent Dirichlet series. 


Proof. Let ap be the coefficient from the definition of L(E, s). If p is a prime defining a 
bad reduction of E, then |ap| < 1. If p defines a good reduction, then E (Fp) consists of 
the infinity point and a points (x, y, 1), where x,y € F, and y? = z? + cox + c3. This 
gives #E (Fp) < 2p +1 and hence |ap| = |#E(Fp)—p— 1| < p. We can write the factor 
L (E, s) for “good” primes in the form 


1 
(hap) apy 





L,(E,s) = 


where 
1—apX +pX’° = (1— 7X )(1 = fy) 





The roots rp, r, are equal to 5 (ap + /a2 — 4p) and clearly satisfy |rp| < |ap| < p. Thus 
we can write down the infinite product L(E, s) as the product L1(s)L2(s), where each 
factor satisfies the assumption of the previous lemma with c = 1. The assertion follows 


from the lemma. 














In fact, we can do better and prove the convergence of the L-series for Re s > 3. For 
this we invoke the following 


Theorem 13.1. (H.Hasse) In the above notation 
|p +1- #E(Fp)| < 2p. 


Proof. We refer to [Knapp] for an elementary proof of this theorem due to Yu. Manin. 
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13.3 Now we are familiar with two Dirichlet functions both absolutely convergent for 
Re s > 2. One is the Dirichlet series Z/(s) associated to a cusp form f of weight 1 with 
respect to 'g(V) and L(E, s). The next conjecture relates these two functions: 


Conjecture. (Hasse-Weil) Let E be an elliptic curve over Q. Define the conductor of E 
to be 
N=] [p”, 
p 


where p runs in the set of primes for which E has a bad reduction, and ap = 1 if the 
reduction is of multiplicative type, and Ap = 2 otherwise. There exists a unique f € 
Mı(To(N))? such that 

Z p(s) = L(E,s),Res > 2. 


Moreover, f is an eigenvector of all the Hecke operators and also an eigenvector for the 
operator Wy. 


Notice that according to Remark 12.3, the form f must be a newform. Applying 
Corollary 12.2, we obtain the following: 


Corollary 13.2. Assume the above conjecture is true. Then L(E, s) admits a holomorphic 
extension to the entire complex plane and satisfies the following functional equation: 


N? (2r) T(s)L(E, s) = +N F (Qn) 827 (2 — s)L(E,2 — s). 





In fact, the previous conjecture was motivated by this assertion. It turns out that the 
latter corollary is almost equivalent to the Hasse-Weil conjecture. One observes first that 
Z 7(s) satisfies the following additional property. Let 


x:Z—> C 


be a Dirichlet character modulo m. Recall that it means that x(n) = 0 if (n,m) A 1 and 
the induced function on (Z/mZ)* is a homomorphism to C*. We say that x is a primitive 
character if x is not a Dirichlet character modulo any proper divisor of m. Let us modify 
the zeta function Z(s) = $- © associated to a modular form by setting 


Z5(8:x) => a 
n=1 





There is an analog of Corollary 12.2: 


Theorem 13.2. Let f € Mg (To(N)} satifying Wy f = ef. For any primitive Dirichlet 
character x modulo m, where (m, N) = 1, set 


Ry(s;x) = (m? N)®? (2r) T (s)Zp(s; x). 
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Then 
Ry(s3x) = e(—1)*m7 G(X)? X(N) Ry (2k — 83 X). 


Here x denotes the conjugate Dirichlet character defined by x(n) = x(n) and G(m, x) 
is the Gauss sum defined by 


Proof. Let 
METON), X) = {f € MTN) : fle (25) = x0 


Clearly, Mg (To(N), x) C Mk(T1(N)), where 


rı(N)= e 4 ETp(N):a=6=1 modulo N} 
We can apply Theorem 12.1 to any cusp form f € Mp(To(N), x). Now we use the 
following “shift trick”: 


OO OO 
f =X cng” € MeCN); Y) = fx = X x(n)eng” € Ma (Po(M); x74), 
n=1 n=1 
where 4 is a primitive Dirichlet character modulo a divisor s of N, x is a primitive charac- 
ter modulo some number m, and M is the least common multiple of N, m7, and ms. The 
proof of this fact is a straightforward check using some known properties of the Gauss 
sums. Taking w = 1, we obtain that 


Rg (s; x) = R,(s), 


where g € M(T(Nm?); x7). Now we apply Theorem 12.1 to R,(s), previously checking 
that 
Wyma = X(N)G(x)?m™ fy. (13.5) 














Theorem 13.3. (Weil’s Converse Theorem) Let L(s) = X7} Cnn ~* be a Dirichlet series 


n=l 


with |cn| = O(n®) for some a > 0. Let N, k be positive integers and € = +1. Suppose 





(i) the function R(s) = N*/?(2m)~*T(s)L(s) extends to a holomorphic function on 
the entire complex plane, is bounded in every vertical strip, and satisfies the func- 
tional equation 


R(s) = e(—1)* R(2k — s); 
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(ii) for every integer m coprime with N, and every primitive Dirichlet character x 
modulo m, set 


Lys) = D> enx(n)n™ 
n=1 
and assume that the function 
Ry(s) = (m?N)*? (2) T (s) Ly (8) 


extends holomorphically to the entire complex plane, is bounded in every vertical 
strip, and satisfies 


Ry(s) = e(-1)*m7"G(x)?x(N) Ry (2k — 8); 


(iii) the series L(s) converges absolutely at s = 2k — 6 for some 6 > 0. 


Then there exists f € My(Uo(N))° such that 


L(s) = Z}(s). 


We are skipping the proof referring to [Ogg] or [Miyake]. 


13.4 Let us check the Hasse-Weil conjecture in the case when Æ = Xo(N). Using the 
formula for the genus of a modular curve from Lecture 8, it is not difficult to see that NV 
must belong to the set 


{11, 14, 15, 17, 19, 20, 21, 24, 32, 36, 49}. (13.6) 


We shall use the theory of Hecke operators for F = ['9(V). In Lecture 11 we considered 
only the case T = ['(1), so we have to rely on Exercises 11.7-11.9 instead. Let op = 
(2 a where p is a prime number. According to Exercise 11.7, the matrix ap defines a 
correspondence on H/To(N) which we denote by T(p). We can use the same matrix to 
define a Hecke operator on the space of modular forms M;(To(V)) (see Exercise 11.9). 
The following is a simple description of the Hecke correspondences Tp) in the case 
(p, N) = 1. We know that each point of H/To(N) can be interpreted as the isomorphism 
class of a pair (E£, H), where E is an elliptic curve and H is its subgroup of order N. 
Equivalently, the pair (E, H) can be viewed as the pair of numbers (j (E), j(£’)), where 
E! = E/H. Let So, S1,- . . , Sp be the set of subgroups of order p in pE S (Z/pZ)*. We 
have 


Assume p is prime of a good reduction for Xo(V). Let Xo(p) denote the corresponding 
reduction. This is an elliptic curve (= a curve of genus 1) defined over the field Fp. The 
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reduction of the affine part H/To(N) of Xo(N) modulo p is an affine curve Vo(V), over 
Fp. Its points over a field K of characteristic p correspond to isomorphism classes of 
pairs (E, H) as above defined over K. There is one important difference between elliptic 
curves over a field of characteristic 0 and over a field of characteristic p > 0. In the former 
case the group of p-torsion points consists of p? elements. In the latter case, it consists of 
p elements or it is trivial (see Exercise 13.2). So, the degree of the correspondence T (p) 
obtained from T (p) by reduction modulo p must be equal to one. 

In characteristic p > 0 there are regular maps of algebraic varieties which are bijective 
on the set of point but nevertheless are not isomorphisms. An example of such a map is 
the Frobenius map. It is induced by the map of projective space defined by the formula: 


Pie DG ay, > ees ee: 


Let X be a projective algebraic subvariety in P” defined by equations with coefficients in 
a field K of characteristic p > 0. Let X) be the variety whose equations are obtained 
from those of X when its coefficients are raised in p-th power. Then Fp restricts to a 
regular map Fp : X > X (P) of algebraic varieties. In the special case when K = Fp 
we have X = X”) so Fisa map of X to itself. Although it is the identity on the set 
X (Fp) of points with coordinates in F,, it is not the identity on the set X (F,) of points 
with coordinates in the algebraic closure of F,. When X = F is an elliptic curve over 
F, the map F is a homomorphism of groups E(F,) —> E(F,). One can show that the 
endomorphism [p] : x — x? of the group E(F,,) factores through Fp. Let [p] = FY o Fy. 
We have the following: 














Theorem 13.4. (Eichler-Shimura) Let p be a prime of good reduction for Xo( N). Then 
we have the following equality in the ring Corr(Vo(N)p(Fp)): 


T(p) = F, + Py 


Proof. (following [Milne]). We will only sketch it. Let us show that the two corre- 
spondences agree on a certain open subset of points of Vo(N). Consider a point P € 
Vo(N) (Fp) and lift it to a point P’ € Xo(N)(Q), where Q is the algebraic closure of 
Q. The point P’ can be represented as the isomorphism class of a pair (E, H), where 
E is an elliptic curve H is a cyclic subgroup of order N of E(Q). Equivalently, we can 
view this point as an isogeny E — E” with kernel H. The reduction modulo p defines a 
homomorphism ,£(Q) > ,£(F,) whose kernel is a cyclic group Ao of order p. Here we 
assume that F is an ordinary elliptic curve, i.e. »E(F,) is of order p. Let Ag,..., Ap be 
the subgroups of order p of Æ. Then each A;, i # 0 is mapped to the subgroup sf sides p 
in Ē. Let E; denote the reduction modulo p of the elliptic curve E; = E Ai. Let E! be 
the similar notation for the curve Ej. The multiplication map x > px of E factors as 


EE, E. 
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When 2 = 0, the first map is purely inseparable of degree p, and the second map is 
separable of degree p. When i Æ 0 the first map is separable and the second one is 
inseparable, both are of degree p. We have, in both cases, 


EP =~ Ey, EP = Él i>. 
One can show that i 7 NORS 
(BE), BN) = F(E, E’). 


Thus F (P) = (Eo, Ef and F,(ÉEi, EH) = P,i > 0. This implies that T(p) = F, + 
F! 
Pp 














Let E be an elliptic curve defined over a field K of characteristic p > 0. One can show 
that for any prime l Æ p the group jn E(K) of points of order dividing /” defined over the 
algebraic closure K of K is is isomorphic to (Z/I"Z)*. Of course we know this fact when 
K = C. Since for any m > n we have a canonical homomorphism jm E(K) > jn E(K) 
defined by multiplication by /’"~”. Passing to the projective limit we obtain a rank 2 free 
modulle T;(£) over the ring of /-adic numbers Z). It is called the Tate module of E. 

Let a be an endomorphism & of E (= a map of algebraic varieties which induces 
a homomorphism of groups E(K) — E(K)). It defines a homomorphism of groups 
mE(K) > jn E(k). Passing to the projective limit we obtain an endomomorphism of 
the Tate module 


pila) : T(E) > T(E). 
It is called the l-adic representation of a. 
We shall apply this to the case when K = F, and a = Fy is the Frobenius endomor- 
phism. 


Theorem 13.5. Let ap = p + 1 — #E(Fp) and rp,r,, are the roots of the polynomial 


P — apT + T°. Then rp, r, are algebraic integers, and considered as elements of the 


algebraic closure of the field Q; of l-adic numbers they coincide with the eigenvalues of 
the l-adic representation of Fp on T(E). 











Proof. We refer to the proof to [Silverman]. 





Remark 13.1. One should compare this result with the well-known Lefschetz formula 
in topology. If one interprets T,(E) as the first cohomology H! group of E, then the 
Lefschetz formula says that for any map f the set of fixed points of f (i.e. points x 
such that f(x) = x) is equal to the sum X` (—1)Tace( f*| H+). In our situation f is equal 
to the Frobenius map, and its fixed points are obviously the points x = (ao,...,@n) 
satisfying a? = a;, or equivalently x € E(F,). We have Trace(f*|H') is equal to the 
sum of eigenvalues of F, in T(E). Also H° = H? = Z, and Trace(f*|H°) = 1, 
Trace( f*| H?) = p, the degree of the Frobenius map. 
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Now everything is ready to verify the Hasse-Weil conjecture for elliptic modular curve 
Xo(N). Consider the characteristic polynomial of p;( Fp). It is equal to 


P(T) = T° — aT + det(pi(F,)). 


We know that det(p,(Fp)) = rpr{, is an algebraic integer, and by Hasse’s theorem |rp + 


as p'/?. This easily implies that TpT = p. Thus 
P(t) =T? — aT +p. 


Since Fp o F} = p, we see that pı(Fp) + pı(F}) acts on T(E) as the multiplication 
by ap. This implies that Fp + F, is equal to ap as an element of Corr(Vo(N)p). By 
Eichler-Shimura’s Theorem, the Hecke correspondence T (p) = ap. From this we obtain 
that T (p) = ap as a correspondence on Xy/To(N). It follows from Corollary 8.4 that 
dim Mı (To(N))° is one-dimensional. Let f be a non-zero parabolic form from this space 
normalized in such a way that its Fourier expansion is of the form q+)- 7s cng”. Clearly, 
f is an eigenfunction for all the Hecke operators T(n). By Lemma 11.3, T(p)f = cnf. 
Comparing with the above, we obtain cp = ap. Thus the infinite product expansion for 
Z f(s) coincides with the infinite product for L(Xo(N), s), up to a finitely may factors 
corresponding to prime p of bad reduction for Xo(V). Using Weil’s Convese Theorem it 
is not hard to deduce from this that the Dirichlet series of f coincides with the L-series of 
Xo(N). 


13.4 Let E be an elliptic curve over Q and G = Gal(Q/Q) be the Galois group of the 


algebraic closure of Q. It acts naturally on the group of E(Q) of Q-points of Æ. This 
action defines a linear representation of G in the Tate module of E: 


pri: G > GL(T,(E) 8 Qı) = GL(2, Qi). 


Now for any prime number p the group G contains a distinguished element Frob,,, called 
theFrobenius element. It is defined as follows. Let cp € Gal(Q »/Q,) be the pre-image of 
the Frobenius automorphism of the residue field Fp. Choose an embedding Q > Q, and 
define Frob, as the image of op under the inclusion Gal(Q/Q) —> Gal(Q,/Q,). Assume 
E has a good reduction modulo p and p Æ l. Then, one proves that 


pri(Frobp) = pp (Fp), 
where E is the reduction of Æ modulo p. Thus we have 
det(1 — pg„(Frobp)T) = det(1 — pg (Fp)T). 
In particular, if L(s, E) = Z;(s) for some modular form f € M,(To(N))°, then 


det(1 — pxy(Frob,)T) = p — apT + T°, 
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where a, are the Fourier coefficients of f. Here we assume that f is an eigenvector for 
all the Hecke operators and a, = 1. We shall refer to such modular forms as normalized 
eigenforms. 

Now let f € Mz(To(N), x)? be any cuspidal modular form with a Dirichlet character 
which has the previous properties. Let K be an extension of Q generated by the Fourier 
coefficients of f. We know that K is a finite extension. For any finite place A of K let K) 
be the completion of K at À. Deligne constructed a representation 


Pf? Go GL(2, Ky) 
such that for each prime p we have 
pra((Froby) = p — apT + T°. 


This representation is irreducible and is uniquely defined. Conjugating by a matrix from 
GL (2, Ky) we may assume that the matrices defining this representation have coefficients 
in the ring of integers O) of Ky. Reducing them modulo the maximal ideal, we obtain a 
representation 

Pfa: G > GL(2,F), 


where F is a finite field. 


Definition. Let F be a finite field. A representation p : Gal(Q/Q) —> GL(2, F) is called 
a modular representation if it arises from a normalized eigenform f € Mg(To(N), x)° 
for some N, k, and x. 


Note the modular representation has the property that p(c) = —1, where c is the 
complex conjugation automorphism of Q. Representations Gal(Q/Q) > GL(2,F) with 
this property are called odd. 


Conjecture. (J.-P. Serre) Any odd irreducible representation Gal(Q/Q) > GL(2,F) 
is modular unless F is of characteristic p < 3 and p is induced by a character of 


Gal(Q/Q(V—1)) if p = 2 and by a character of Gal(Q/Q(V—3)) if p = 3. 


In fact, Serre gives a conjectural recipe for finding an appropriate (NV,k, x). For 
example, it predicts (N, k, x) for representations arising by reduction modulo p from the 
p-adic representations py associated to an elliptic curve FÆ over Q with whose reductions 
are all either good or of multiplicative type (we say then that E has stable reductions). 
Then N is equal to the product of all primes | 4 p such that the discriminant Ap of E 
has order at / not divisible by p; k = p+ 1 if vp(A pg) is not divisible by p and equals 1 
otherwise; x = 1. 


Theorem 13.6. Serre’s conjecture implies Fermat’s Last Theorem. 
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Proof. Let (a,b,c) be a non-trivial solution of x” + y” = z”. It is known that without 
loss of generality we may assume that n = p > 5 is prime and p does not divide a and b. 
Also we may assume that a = —1 (mod 4) and that b is even. Consider the elliptic curve 
E given by the Weierstrass equation 


y= x(x — a?)(a + bP). 
It can be verified that Æ has semi-stable reductions and 
Ap = —2°(abc)”?. 


In particular p|v,(Az). Consider the representation pgp and it reduction modulo p. It 
can be checked that this representation is irreducible and odd. If Serre’s Conjecture is 
true, then pz, is a modular representation and Serre’s recipe gives N = 2,k = 1,x =1. 
However, Mı (To(2))? = {0}. 














13.5 For the following we shall use the notion of the Jacobian variety of a compact 
Riemann surface X. It is defined as a complex torus J(X) = C9/A, where g is equal to 
the genus of X and A is the lattice in C? spanned by the vectors 


m= (f isso wi), t= Deeg 
yı Y: 


2g 


for some basis w1, ...,wg of the space of holomorphic differentials on X and a basis 
Y1,- --, Y2g Of homology 1l-cycles on X. Fixing a point pọ € X we obtain a natural 
holomorphic map ip : X — J(X) defined by the formula: 


p p 
pf TEN wg) modulo A. 


Po Po 


It is an isomorphism when g = 1. This map extends to a map from the group of divisors 


Div(X) by the formula 
ipo >D npp) = y Npipo (Pp), 


where the addition in J(X) is the addition in the factor group of the additive group of 
CI. By Abel’s theorem this map defines an isomorphism from the group of diviors on X 
modulo linear equivalence onto the group J (X). 

Let Z be a finite holomorphic correspondence on X, i.e. Z is a subvariety of X x X 
defining a finite correspondence on the set of points of X. As we saw in Lecture 11, Z 
defines a homomorphism from Div(X) to itself. It is easy to check that it sends principal 
diviors to principal divisors, and hence defines an endomorphism of the Jacobian variety 
J(X). We shall apply this to the case when X is a modular curve and a correspondence 
is a Hecke correspondence on it. 
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Although we defined the Jacobian variety as a complex torus, one can develope a 
purely algebraic theory for J(X) valid for nonsingular projective curves X defined over 
an arbitrary field K. In this theory J(X) is a projective algebraic variety whose set of 
points J(X)(K') over any extension A’ of K has a natural structure of an abelian group. 
Also, for any point po in X(/‘) there is a regular map ipp : X — J(X) defined over 
the field K. It induces an isomorphism from the group of K-divisors on X modulo linear 
equivalence onto the group of K-points of J(X). There is an analogue of the Tate module 
T,(J(X)) for J(X) and of the l-adic representation of Gal(K /K) in it. 


13.6 We know that the Hasse-Weil conjecture is true for an elliptic curve of the form 
Xo(N). Let E be an elliptic curve over Q, assume that, for some N, there exists a 
nonconstant regular map defined over Q from X(N) to Æ. We say that E is a modular 
elliptic curve or a Weil elliptic curve. 


Theorem 13.7. Let E be a Weil curve. Then it satisfies the Hasse-Weil conjecture. Con- 
versely, if E is an elliptic curve over Q satisfying the Hasse-Weil conjecture, then E is a 
Weil elliptic curve. 


Proof. We shall only sketch a proof. Suppose E satisfies the Hasse-Weil conjecture. 
Then L(E,s) = Z; for some newform f € M;(I'o(M)))?..,. For any prime p not 
dividing Np, the characteristic polynomial of Frob, coincide with respect to the /-adic 
representations pg; and pf. Using the continuity of the /-adic reprsentation and the 
fact that the Frobenius elements form a dense subset in the Galois group G of Q (the 
Chebotarev theorem) we obtain that pz = pr. Now let us consider f as a holomorphic 
differential form on Xo(M). Since f is an eigenfunction for the the Hecke ring T my, we 
have a character 6 : Ty — Q defined by the eigenvalues. Let T be the kernel of 6. The 
Hecke ring acts on Xo(/) via correspondences, and hence acts on its Jacobian variety 
Jo(M) via endomorphisms. Let A = Jo(M)/TJo(M). This an abelian variety and its 
tangent space is naturally isomorphic to Cf. In particular, A is a elliptic curve. Applying 
the Eichler-Shimura theorem, we can show that the characteristic polynomial of Frob, in 
the /-adic representation of A is expressed in terms of the Hecke operators: 


det(p.4,(Frobp) — tl) = t° — O(T(p)t + pO(T(p,p)). 


This allows us to verify that L(E, s) = L(A, s). By a theorem of G. Faltings, the elliptic 
curves E and A are isogeneous over Q, and in particular their conductors are equal. This 
will imply that Ng = M, and there exists a regular map over Q from Jo(N) to E. 
Composing it with an embedding of Xo(NV) in Jo(V) we obtain that E is modular. 

Now assume that E is a Weil elliptic curve and let Xo(N) > E be a regular map 
over Q. The space of holomorphic differential forms on E is one-dimensional over C. 
By constructing a certain “Neron model” of EF over Z one produces a certain 1-form, 
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whose pre-image on Xo( N) is a holomorphic differential form such that, after identifying 
it with a cusp form f of weight 1, its Fourier coefficients at infinity are rational numbers. 
Again by the Eichler-Shimura theorem one can check that f is an TU )-eigenform with 
eigenvalues Àp of T (p) satisfying \, = p + 1 — #E (Fp) for all prime p not dividing 
N. Projecing it to the subspace of M1 (To(N))} c we find a newform f. Applying some 


results of Deligne-Langlands-Carayol one can show that L(E, s) = Z+(s). 














We now see that the Hasse-Weil conjecture is equivalent to the following: 
Conjecture. (Shimura-Taniyama-Weil) an elliptic curve over Q is a Weil elliptic curve. 


We have seen already that Serre’s Conjecture implies Fermat’s Last Theorem. It was 
shown by K. Ribet and B. Mazur, that the fact that the elliptic curve used for the proof of 
Ferma is modular implies the Ferma Theorem. Let us sketch the proof of the following: 


Theorem 13.8. The Shimura-Taniyama-Weil conjecture implies Fermat’s Last Theorem. 


Proof. We apply the STW-conjecture to the elliptic curve E from the proof of Theorem 
13.6. It is easy to compute its conductor N pg: it is equal to the product of primes divisors of 
abe Consider, as in the proof of Theorem 13.6, the representation Jg p : G + GLo(F,). 
If E is a Weil elliptic curve, the representation pp is an irreducible modular representation 
of level N and weight 1 with trivial character x. Let l be a prime divisor of Ng. We know 
that p|1,(A pg) if l 4 2. This implies that the representation Jg p is finite at l. When | 4 p 
this means that the restriction of Ag p to Gal(Q;/Q;) is unramified (i.e. factors through a 
representation of the Galois group of a finite unramified extension of Q). When p = l, the 
definition is a little more technical, and we omit it. Now we apply a theorem of Mazur- 
Ribet which implies that Jg p is modular of level N/l. Here we use the assumptions that 
IIN but p?,/? {Ng and l Æ 1 mod p. After applying this theorem several times, we find 
that py is modular of level 2. Now we end as in the proof of Theorem 13.6 by finding 


contradiction with absence of parabolic modular form of level 1 for the group To(2). 














Theorem 13.9. (A. Wiles) An elliptic curve over Q with semi-stable reductions for each 
prime number is a Weil curve. 


Corollary 13.3. Fermat’s Last Theorem is true. 


Proof. Observe that the elliptic curve Æ used in the proof of theorem 13.8 has semi-stable 
reductions at each prime p. 














169 


Exercises 


13.1 Let E be an elliptic curve over a field K. Define the group law on the set of E(k) 
of points of E with coordinates in K as follows. View a point P as a divisor of degree 
1. Assume that E(k) 4 Ø. Fix a point 0 € E(k). For any two points P, Q the space 
L(P +Q — 0) is of dimension 1 over K (the Riemann-Roch Theorem). Thus there exists 
a unique postive divisor of degree 1 linearly equivalent to P + Q — 0. This divisor is 
denoted by P & Q and is called the sum of the points P and Q. 


(i) Show that the the binary law of composition on ÆE(K) defined by P © Q is a 
commutative group. 


(ii) Show that, when K = C, the group law agrees with the group law on the complex 


torus E(C). 
13.2 Let E be an elliptic curve over an algebraically closed field K with the group law 
defined in the previous exercise. Let fo,..., fn—1 be a basis of the space L(nO). Show 
that 


(i) the map 6: E \ {0} > P™!, P —> (fo(P),.--, fp-1(P), has the image an 
algebraic curve C of degree n. 


(ii) Let C be the closure of C in the projective space. Show that for any n-torsion point 
P there exists a hyperplane in P”~! which intersects C at one point equal to ¢(P). 


(iii) Let n = 3. Fix a line L in P? which is not a tangent to C and consider the map 
from C to L which assigns to a point x € C the intersection point of the tangent of 
C at x with L. Use the Hurwitz formula to show that C has exactly nine 3-torsion 
points if K is of characteristic 0. 


(iv) Assuming that n = 3 and E has at least 3 torsion points of order 3, show that the 
equation of C can be chosen in the Hesse form x? + y? + 2° + Axyz = 0. 


(v) Show that in the case K is of characteristic 3, there are at most 3 points of order 3 
on E. 


13.3 Let x be a Dirichlet character modulo m. Define the Dirichlet series Lm(s; X) = 
YOz X(n)n-*. Show that 


(i) Lm(s;x) is absolutely convergent for res > 0 and admits an infinite product ex- 
pansion 
Lals; x) = [C - xpp). 
piim 
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(ii) Show that Lm(s; X) admits a holomorphic extension to the entire complex plane 
which satisfies the functional equation 


Lm(1 — 8X) = Lin(8, x)(m/2m)T(s)(e%/? + x(-1)e7™" GO), 


where G(x) is the Gauss sum of x. 


Index 


algebraic variety, 26 divisor, 97 
divisor of a function, 98 


Abels Theorem, I divisor of a meromorphic differential, 99 


absolute invariant, 94 doubling identity, 47, 127 
addition formulae, 37 

analytic space, 19 Eisenstein form, 71 
anharmonic, 21 elliptic curve, 19 
automorphic form, 61 modular, 167 


binary code, 89 ordinary, 162 


Weil, 167 
canonical class, 98 elliptic curve over Q, 155 
class, 9 elliptic function, 28, 36 
class of quadratic forms, 3 elliptic integral, 70 
complex manifold, 16 elliptic point, 102 
complex multiplication, 10, 124 Epstein zeta function, 149 
complex structure, 16 equivalent, 3 
complex torus, 19, 22 equivalent theta factor, 28 
conductor, 159 Euler identity, 46 
correspondence, 129 
cross-ratio. 111 finite correspondence, 129 
cusp, 62 Fricke involution, 125 
cusp form, 63 Frobenius element, 164 
cyclic isogeny, 123 Frobenius map, 162 
fuchsian group, 97 

Dedekind 7)-function, 44 full modular group, 64 
degree of divisor, 98 fundamental domain, 9 
degree of the meromorphic function, 101 fundamental parallelogram, 14 
Dirichlet character, 159 

primitive, 159 Gauss identity, 47 
Dirichlet series, 143 Gauss sum, 160 
discrete subgroup, 93 Gauss’ transformation formulas, 55 
discriminant, 64, 89, 156 Gaussian sums, 47 


171 


172 


genus, 99 

geometric space, 15 

Gleason’s Theorem, 90 

group of classes of divisors, 98 


harmonic, 21 
Hauptfunction, 126 

Heat equation, 37 

Hecke algebra, 139 
Hesse equation, 35, 169 
holomorphic action, 17 
holomorphic at a cusp, 63 
holomorphic map, 16 
Hurwitz formula, 101 


index, 63 

integral, 87 

integral quadratic form, 9 
isogeny, 123 


Jacobi cosine function, 116 
Jacobi form, 76 

Jacobi identity, 47 

Jacobi sine function, 116 

Jacobi theta functions, 31 

Jacobi triple product identity, 47 
Jacobi’s identity, 113 

Jacobian variety, 166 


k-gonal numbers, 46 

Kortweg-de Vries equation , 75 
Kronecker Theorem, 124 
Kronecker’s congruence relation, 121 


l-adic representation, 163 
L-function, 157 
lambda-function, 115 
Landen’s transformation formulas, 55 
lattice, 1 
dual, 89 
integral, 84 
Leech lattice, 86 


INDEX 


Legendre equation, 112 
Legendre-Weierstrass relation, 57 
linear code 
doubly even code, 89 
self-dual, 89 
linearly equivalent, 98 
local parameter, 97 


MacDonald identity, 46 
Mac Williams identity, 90 
Mellin transform, 145 
meromorphic at a cusp, 63 
meromorphic differential, 98 
meromorphic function, 26, 97 
minimal Weierstrass equation, 156 
modular curve, 97 
modular equation, 122 
modular figure, 7 
modular form, 63 

Dirichlet series associated to, 143 
modular function, 63 
modular representation, 165 

odd, 165 
Moebius transformation, 6 
Monster group, 94 


newform, 153 
normal elliptic curve, 77 
normalized eigenform, 165 


orbitfold, 19 

order, 80, 97 

order of isogeny, 123 
order of pole, 97 
order of zero, 97 


parabolic form, 63 

Petersson conjecture, 150 
Petersson inner product, 137 
primitive quadratic form, 9 
principal congruence subgroup, 64 


INDEX 


principal divisor, 98 
properly equivalent, 6 


Ramanujan function, 84 
Ramanujan’s Conjecture, 150 
rational function, 26 
reduced, 5 
reduction 

additive, 156 

bad, 156 

good, 156 

multiplicative, 156 
represents, 3 
Riemann surface, 16 
Riemann theta function, 27 
Riemann zeta function, 71 


stable reduction, 165 
structure sheaf, 15 


Tate module, 163, 167 

theta factor, 28 

theta function of a lattice, 84 

theta functions with rational characteris- 
tics, 30 

theta series of the lattice, 76 

Thomae’s Formulae, | 14 

type of isogeny, 123 


unimodular, 85 
upper half-plane, 5 


weak meromorphic modular form, 61 
weak modular form, 61 

Weber functions, 45 

Weierstrass o-function, 57 
Weierstrass ¢-function, 75 
Weierstrass cubic equation, 67 
Weierstrass function, 69 

weight, 61, 89 

weight enumerator polynomial, 90 
Weil elliptic curve, 167 


173 


